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FOREWORD 


The  Twenty-Sixth  Conference  on  the  Design  of  Experiments  (DOE)  in  Amy 
Research,  Development  and  Testing  was  held  22-24  October  1980  and  had 
as  Its  host  the  U.  S.  Army  White  Sands  Missile  Range  (WSMR).  Dr.  Richard 
H.  Duncan,  Technical  Director  and  Chief  Scientist,  WSMR,  made  many  of  the 
initial  plans  for  this  meeting.  In  particular,  he  contacted  Dr.  Harold  Law, 
Associate  Academic  Vice-President,  New  Mexico  State  University  and  made 
arrangements  with  him  to  hold  the  conference  at  his  university.  The 
Army  Mathematics  Steering  Committee  (AMSC),  sponsor  of  these  conferences, 
would  like  to  thank  New  Mexico  State  University  for  providing  such 
excellent  facilities  for  this  meeting. 


Dr.  Duncan  asked  Ms.  Peggy  Hoffer,  of  the  WSMR  Plans  Office,  to  serve 
as  chairman  on  Local  Arrangements  for  this  conference;  and  Mr.  Robert 
Green,  of  the  Instrumentation  Directorate,  to  handle  any  technical 
problems  associated  with  the  program.  These  individuals,  together  with 
many  other  members  of  the  WSMR,  helped  make  this,  the  26th  Conference, 
a  very  successful  and  Interesting  meeting. 

The  Subcommittee  for  Probability  and  Statistics,  chaired  by  Dr.  Douglas 
B.  Tang,  Is  responsible  to  the  AMSC  for  conducting  these  Army  conferences. 
Dr.  Tang  asked  Dr.  Frank  E.  Grubbs  to  be  Chairman  of  the  Program  Committee 
for  the  1980  conference.  One  of  the  first  acts  of  this  committee  was  to 
select  "Data  Analysis"  as  the  theme  of  this  meeting.  This  was  a  wise 
choice  because  of  the  large  amount  of  analytical  and  statistical  work  in 
testing  and  modeling  performed  within  the  many  aqencies  located  on  the 
base  of  the  host  Installation.  At  the  first  meeting  of  the  Program 
Committee,  the  following  national  known  scientists  were  selected  as  the 
Invited  speakers  for  this  year's  conference. 


Speaker  and  Affiliation 

Professor  Francis  J.  Anscombe 
Yale  University 

Dr.  Toby  J.  Mitchell 

Union  Carbide  Nuclear  Division 

Professor  W.  J.  Conover 
Texas  Tech  University 


Professors  James  R.  Thompson, 
Chlh-Chy  Fwu,  and  Richard 
A.  Tapia 
Rice  University 

Professor  Victor  Solo 
Harvard  University 


Title  of  Address 

How  Far  to  go  in  Looking  at  Data 


Design  of  Experiments 


The  Rank  Transformation  as  a  Robust 
and  Powerful  Tool  for  the  Analysis 
of  Experimental  Data 

The  Nonparametric  Estimation  of 
Probability  Densities  in  Ballistic 
Research 


Engineering  Time  Series  Analysis 


Professor  Richard  A.  Johnson 
University  of  Wisconsin 


'"‘.ress-Strenij? h  Models  for  Pe'iaH'Hty 
Overview  and  Recent  Advances 


Professor  Badrig  Kurkjlan,  Professor  of  Statistics  at  the  University  of 
Alabama,  Is  at  the  present  time,  chairman  of  the  committee  to  select  the 
recipient  of  the  Samuel  S.  Wilks  Memorial  Medal.  On  19  June  1980  he 
advised  Dr.  Robert  Launer,  secretary  of  the  Design  of  Experiments  Conference, 
that  Dr.  W.  Allen  Wallis,  Chancellor  and  Professor  of  Statistics  and 
Economics  at  the  University  of  Rochester,  had  been  selected  as  the  1980 
Wilks  Medalist.  This  distinguished  scientist  richly  deserves  this  honor 
for  his  contributions  to  applied  statistics. 

On  20-21  October  1980,  just  preceding  the  start  of  the  DOE  conference,  a 
special  tutorial  on  Applied  Regression  Analysis  was  held.  This  tutorial 
was  designed  for  engineers,  scientists  and  statisticians  who  are  involved 
in  analyzing  least  squares  data,  the  associated  statistical  inferences 
and  model  building.  The  Instructor  for  this  Informative  course  was 
Professor  Norman  Draper,  Department  of  Statistics,  University  of  Wisconsin 
and  the  Mathematics  Research  Center. 

The  AMSC  has  asked  that  these  proceedings  be  distributed  Army-wide  to 
enable  those  who  could  not  attend  this  conference,  as  well  as  those  that 
were  present,  to  profit  from  some  of  the  scientific  ideas  presented  by 
the  speakers.  The  members  of  the  AMSC  would  like  to  take  this  occasion 
to  thank  all  the  speakers  for  their  interesting  presentations  and  also 
the  members  of  the  Program  Committee  for  their  many  contributions  to 
this  scientific  meeting. 


Program  Committee 


Carl  Bates 
George  E.  P.  Box 

Larry  Crow 
Walter  Foster 


Robert  Launer  (Secretary) 
Douglas  Tang  (Chairman, 

Prob.  A  Stat.  Subcommittee) 
Malcolm  Taylor 
Langhorne  Withers 
Frank  E.  Grubbs  (Program 
Committee  Chairman) 
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Dear  Conference  Participant: 


It  is  our  pleasure  to  have  you  attend  the  fwenty-Sixth  Conference  on  the 
Design  of  Experiments  in  Army  Research,  Development,  and  Testing.  We  hope 
this  conference  will  provide  new  and  relevant  in  format  ion  which  will  ht 
useful  to  you  in  vour  future  endeavors. 

During  your  stay  in  Las  Cruces  we  also  hope  von  will  tale  tfv-  jjrportut  ity 
to  visit  the  rest  of  the  New  Mexico  State  University  .  i.tpus  and  White 
Sands  Missile  (tangs*. 

New  Mexico  State  Univcrsi tv,  fomded  in  1SSH,  has  grown  into  a  major 
institution  of  higiier  education.  In  its  Jevelopnenr,  rhe  University  has 
preserved  many  of  the  traditions  of  its  land-grant  origin  while  moving 
toward  increased  eirphasis  of  the  hitiunities,  liberal  arts,  and  natural 
sciences.  Tlie  mission  of  New  'lexice  State  University  is  to  benefit 
society  through  education,  research,  and  public  service.  1o  Carry  out 
its  mission,  the  University  operates  the  Agricultural  Lspenner.tai 
Station,  the  Arts  and  Sciences  Research  Center,  the  Center  for  business 
Research  and  Service,  the  New  Mexico  I'nergv  Institute,  t.-.c  New  "W-xico 
Solar  Energy  Institute,  the  Engineering  Experiment  State'll,  the  Mountain 
Laboratory,  the  ihvsical  Science  laboratory,  ,uid  nunv  ither  educational, 
research,  and  service  centers. 

White  Sands  Missile  Range,  established  is  white  S..nds  Proving  .Ircund  on 
9  July  194S,  represents  the  hugest  laud  su  >s  under  ccntioi  f  *te  United 
States  Army  (over  two  mil'i.  ii  acres).  I'ogether  with  its  remote  laun-.n 
areas  in  Utah,  it  allows  for  missile  flights  at  ahotit  nil)  mile'.  Since 
its  est.lbl  LSlunent ,  the  Range  h..s  evolved  into  aue  of  ll.v  :»>st  ajon  test 
facilities  for  use  hv  the  -S'1,  rise  Ajvinced  Res*  arch  Pit  tects  Agency,  Amy, 
Navy,  Air  lorce.  National  ieran.mtics  and  Sp.ivi*  llninist  rut  ion.  iVpartrvnt 
of  lineiRV,  anil  others. 

We  hO|K*  v  si  will  have  an  intern  t  rig .  .niuv.ihie.  and  friitt.il  st.iv. 


A 

MAN  ,\.  .V  K. 

Ma  jor  Iavut.] 

Comn;intl i r.i*  llcni* ;‘-i ]  New  Mcxk**  'Mtr  •  ,  .cr  > ; t  .• 
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HOW  FAK  TO  GO  IN  TOOK  INC  AT  DATA* 


Y.  J  .  An s combe 
Yale  University 
New  Haven,  Connecticut 

All  analysis  of  statlsilc.il  data  involves  a  balancing  feat.  On  tin-  one 
hand,  we  do  not  examine  the  date  with  an  empty  mind,  there  are  specific 
questions  we  want  to  obtain  answers  to  (that's  why  we  assembled  the  data  in 
the  first  place),  we  have  preconceptions  about  the  data,  in  examining  the 
data  we  should  not  forget  what  we  were  looking  for.  But  on  the  other  hand, 
we  should  have  an  open  mind,  we  should  let  the  data  speak  for  themselves,  we 
should  not  just  assume  that  some  theoretical  model  is  appropriate  without 
checking.  How  far  should  we  go  in  responding  to  unexpected  features  of  the 
data,  how  far  should  wc  let  the  data  control  the  kinds  of  things  we  do;  or 
how  far  should  we  trust  our  preconceptions? 

Now  let  me  digress  from  that  theme  for  just  a  moment.  I  have  not  only 
been  invited  to  speak  at  this  conference,  1  have  been  asked  to  give  the  key¬ 
note  address.  1  think  that  a  keynote  address  also  involves  a  balancing  feat. 
On  the  one  hand,  a  keynote  address  should  be  inspirational,  or  if  not  quit-e 
inspirational  at  least  interesting,  or  if  not  quite  interesting  at  least 
fairly  intelligible.  On  the  other  hand,  it  is  not  one  of  the  regular  invited 
papers  or  other  real  business  of  the  conference.  It  should  not  try  to  steal 
their  thunder.  It  should  not  be  too  weighty  or  indi pest ible--it  should  be 
hors  d' oeuvre  rather  than  a  main  course.  To  perform  this  balancing  feat,  I 
shall  raise  some  questions  that  do  not  seem  to  be  talked  about  much,  but 
which  all  of  us  are  aware  of,  and  which  therefore  should  be  somewhat  inter¬ 
esting  to  consider.  In  the  interest  of  digestibility,  I  shall  mostly  refrain 

*  Prepared  in  connection  with  research  supported  by  the  Office  of  Naval 
Research  (contract  NQ0014-7  5-C-0563)  . 


Cron  answering  Che  questions — because  in  most  cases  I  have  no  idea  how  to 
answer  them.  But  at  the  end  of  the  talk  I  will  give  a  little  information 
about  one  of  these  questions  chat  is  possibly  not  well  known,  and  to  that 
extent  the  talk  will  have  a  little  content. 

Forecasting  time  series.  Earlier  this  year  at  a  meeting  I  heard  a  talk  on 
short-term  forecasting  of  business  time  series.  The  talk,  which  was  very 
well  delivered,  threw  out  a  challenge  rather  vividly,  and  I  have  thought 
about  it  quite  a  bit  since.  I'd  like  to  indicate  the  gist  of  the  talk.  (I 
have  not  seen  any  write-up  of  the  talk,  what  I'm  saying  may  not  adequately 
represent  the  speaker's  views,  and  so  I  will  refrain  from  mentioning  his 
name.)  The  situation  considered  was  this.  He  had  several  time  series  re¬ 
lating  to  a  business  company,  production,  sales,  various  things,  and  also 
public  economic  series;  I  think  they  were  all  quarterly  series,  and  they 
went  back  a  good  many  years.  The  object  was  to  forecast  some  of  the  series 
for  the  next  1  or  2  or  3  quarters  after  the  last  observed  value.  A  standard 
method  would  be  to  use  Box-Jenkins  technology--f it  a  parametric  class  of 
models  to  the  data  by  maximum  likelihood  and  use  the  fitted  model  to  make 
the  forecast.  The  parametric  class  is  quite  wide,  and  if  in  fact  it  is  wide 
enough  to  represent  reality  adequately  this  procedure  will  be  Just  about  the 
best  possible.  The  procedure  is  fully  describable,  or  programmable,  and 
therefore  can  be  implemented  completely  by  computer.  The  speaker  didn't  want 
to  do  that.  He  had  imbibed  the  spirit  of  John  Tukey's  Exploratory  Data 
Analysis .  (Tukey  does  not  discuss  forecasting  in  his  book,  and  I  do  not  know 
whether  what  the  Speaker  did  was  similar  to  what  Tukey  might  have  done.)  lie 
plotted  Ills  time  series  and  took  ;•  rood  look  at  them.  lie  noticed  that  round 
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about  1973-74  the  aeries  seemed  to  change  character.  Something  happened  to 
the  economy  (there  was  the  oil  crisis),  and  he  judged  that  subsequent  behavior 
of  the  series  was  unlike  the  preceding  behavior.  So  he  felt  that  only  data 
after  the  change  should  be  used  in  forecasting,  although  there  were  not  many 
readings,  and  he  proceeded  to  make  (I  think)  some  simple  extrapolations  by 
eye.  He  expressed  the  opinion  that  it  was  better  to  do  a  rough-and-ready  job 
with  just  a  few  relevant  reading*  than  a  fancy  job  with  a  lot  of  readings 
that  were  mostly  irrelevant.  If  indeed  there  was  a  big  enough  change  in  the 
functioning  of  the  economy  in  1973-74  to  make  preceding  data  uninformative 
about  later  behavior,  then  he  certainly  had  a  case.  How  can  you  tell?  It's 
a  question  about  whether  reality  is  better  described  in  terms  of  Box-Jenkins 
parametric  models,  or  better  understood  by  someone  who  makes  judgments  based 
on  plots  and  general  background  information,  judgments  that  cannot  (I  think) 
be  computer-programmed.  The  speaker  seemed  to  think  that  obviously  the 
second  was  the  case.  I  don't  think  it's  obvious  either  way.  Some  things 
could  be  done  to  investigate  the  matter,  but  nothing  very  easy  or  very 
quick. 

How  much  should  we  look  at  the  data?  Everyone  agrees  that  we  must  sometimes, 
to  some  extent,  look  at  the  data.  Suppose  we  entertain  some  probabilistic 
model,  or  more  modestly  some  way  of  thinking  about  a  phenomenon  and  possible 
observations.  If  this  model  or  way  of  thinking  is  not  vacuous,  there  are 
some  logically  possible  observations  that  conflict  with  it — otherwise  it 
tells  us  nothing.  Therefore  it  behooves  us  to  see  whether  the  observations 
are  consistent.  Particular  instances  of  .his  arc  ver/  we  1 i  known. 

So  we  must  look  at  the  observations  a  hit.  The  trouble  is,  it  i9  ca  ,y 
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to  be  puzzled  and  misled  if  we  look  at  chem  very  much.  A  sample  (from  a 
population  or  probability  distribution  or  stochastic  process)  has  many 
individual  features  that  do  not  reflect  its  source  and  would  not  persist 
if  the  sample  size  were  much  increased.  Given  adequate  computing  power, 
the  question  of  how  much  to  look,  how  far  to  go,  outstrips  available  sig¬ 
nificance  tests  or  other  critical  apparatus  that  might  aid  our  judgment. 

To  refrain  from  examining  the  data  because  we  do  not  know  how  to  evaluate 
what  we  see,  that  surely  is  foolish.  To  assume  without  evaluation  that 
everything  seen  is  important,  is  foolish  too. 

The  matter  is  brought  home  to  me  whenever  I  take  a  small  random  sample 
from  some  distribution  or  process.  Suppose  I  generate  50  observations  from 
N(0,1)  .  One  of  Che  readings  is  bigger  than  3  and  the  estimated  variance 
is  large.  I  take  another  sample  and  this  time  there  are  no  outliers,  the 
largest  reading  is  only  about  1.5  and  the  sample  looks  as  if  it  came  from 
a  uniform  distribution.  I  wonder  if  my  program  is  wrong,  so  I  go  over  the 
program  again  and  I  take  a  much  larger  sample  and  make  a  goodness-of-f 1 t  test, 
and  then  things  seem  O.K. 

I'd  like  to  show  what  happened  the  last  time  I  tried  a  simulation.  I 
was  wanting  to  illustrate  the  difference  between  a  stationary  random  sequence 
that  was  jointly  normally  distributed  (jointly  Caussi.m)  and  a  stationary 
random  sequence  with  the  same  autocorrelations  (same  moments  of  first  and 
second  orders)  and  marginal  normal  distribution  for  any  one  member  that  was 
not  jointly  normal  and  would  have  a  less  temporally  homogeneous  behavior. 

The  simplest  examples  would  be  Markov  sequences,  and  I  began  with  a  lag-1 

> 

serial  correlation  coefficient  equal  in  “  .  figure  1  shows  I  lie  first  60 
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members  of  a  jointly  normal  sequence;  and  Figure  2  shows  Che  first  60 

members  of  a  type  of  "jump"  process  which  behaves  in  stretches  like  a 

Jointly  normal  random  sequence  with  a  bit  higher  lag-1  serial  correlation 

3 

(actually  ,  but  every  now  and  then  there  is  a  break  and  the  next  reading 

is  independent  of  its  predecessors - sometimes  at  the  break  there  is  a  big 

jump.  So  the  two  sequences  should  look  similar  except  for  occasional  big 
jumps  in  Che  second  process.  Figure  2  looks  as  it  is  expected  to,  but  not 

Figure  1 - the  first  20  readings  move  around  a  lot  (with  one  very  big  Jump 

between  the  tenth  and  eleventh  reading),  then  the  later  readings  are  much 
less  mobile.  I  can't  help  thinking  that  the  speaker  I  heard  on  time-series 
forecasting  would  have  identified  a  change  in  the  economy  round  about 
reading  no.  20. 

I  was  so  disgusted  with  the  untypical  behavior  of  the  first  plot  that 
I  scrapped  them  both  and  tried  again,  this  time  with  a  higher  lag-1  serial 
correlation  (gj  which  I  thought  would  cramp  the  style  of  the  jointly  normal 
process  and  make  it  behave  better.  Both  plots  (Figures  3  and  4)  looked 
reasonably  "typical"  of  what  I  expected.  (But  note  the  apparent  change  in 
direction  of  the  jointly  normal  plot  around  reading  no.  33.) 

How  many  explanatory  variables  in  regression?  The  question  arises  in  dif¬ 
ferent  connections - sometimes  very  troublesome,  sometimes  easy  to  answer. 

The  easy  case  is  a  planned  Factorial  experiment  of  the  classic  Fisherian 
kind  designed  to  permit  estimation  of  the  main  effects  of  various  factors 
and  all  sorts  of  Interactions.  Often  what  happens  is  that  a  few  main  effects 
and  perhaps  a  few  interactions  are  large  and  interesting  and  need  to  be  duly 
reported  and  understood,  while  the  rest  arc  samll  and  for  most  purposes  can 


be  ignored---it  wasn't  known  in  advance  which  effects  would  be  large  and 
interesting,  which  small  and  ignorable.  Usually  the  effects  are  orthogonal; 
their  meaning  does  not  depend  on  what  other  effects  are  estimated.  Provided 
there  are  a  few  degrees  of  freedom  for  estimation  of  error,  and  provided  we 
don't  challenge  the  appropriateness  of  the  model  (structure)  being  fitted, 
there  is  an  easy  answer  to  the  question,  how  many  effects  to  estimate: 
estimate  them  all  and  then  ignore  any  that  aren't  interesting. 

The  proviso  that  the  appropriateness  of  the  model  being  fitted  isn't 
challenged  is  important.  I've  already  sai..  .hat  it  behooves  us  to  verify 
that  models  being  fitted  are  consistent  with  the  data.  Suppose  we  want  to 
check  whether  the  (hypothetical)  unexplained  "error"  term  in  the  model  seems 
to  be  something  like  i.i.d.  normal.  The  obvious  thing  to  do  is  calculate 
residuals  from  the  estimated  effects  or  regression  relation.  How  closely 
the  residuals  reflect  the  hypothetical  errors  in  the  model  depends  on  how 
many  effects  or  regression  coefficients  have  been  estimated.  If  many  have 

been  estimated,  there  is  a  central  limit  effect each  residual  is  an  average 

of  many  errors,  and  does  not  principally  reflect  one.  To  have  informative 
residuals,  small  effects  should  not  be  estimated  but  left  in  the  residuals. 

(The  talk  concluded  by  presenting  some  material  from  Appendix  2  of  the 
author's  forthcoming  book,  Computing  in  Statistical  Science  through  APL 
(Springer).  A  rule  for  deciding  how  many  effects  to  estimate,  due  to 
J.  W.  Tukey,  for  the  purpose  of  obtaining  informative  residuals,  was  described. 
Then  two  further  rules  were  described,  one  of  them  based  on  C.  1..  Hal  lows' s 
Cp  statistic,  designed  to  permit  good  prediction  of  unobserved  values  of  the 
dependent  variable.  It  was  pointed  out  that  these  two  purposes,  informative 
residuals  and  good  prediction,  though  at  first  glance  quite  different,  are 
really  closely  related,  and  the  three  rules  often  lead  to  the  same  result. 

<> 
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THE  U5C  OF  RIDGE  REGRESSION  ill  TRAJECTORY  ESTIMATION 


William  S.  Agee  and  Robert  H.  Turner 
Mathematical  Services  Branch 
Data  Sciences  Division 
US  Army  White  Sands  Missile  Range 
White  Sands  Missile  Range,  Hew  Mexico  83002 

ABSTRACT.  Combining  observations  from  several  different  trajectory 
measuring  instruments  we  want  to  estimate  the  cartesian  position  (velocity) 
coordinates  of  a  trajectory  at  possibly  a  large  number  of  tine  points.  Since 
the  measuring  systems  are  subject  to  systematic  measurement  errors  as  we  1 1  as 
random  measurement  errors,  in  addition  to  estimating  the  trajectory  coordi¬ 
nates  we  also  want  to  estimate  the  measurement  biases.  The  resulting  estima¬ 
tion  problem  is  a  combined  linear  and  nonlinear  estimation  problem  in  which  the 
trajectory  coordinates  appear  as  nonlinear  parameters  in  the  measurements  and 
the  biases  appear  as  linear  parameters  in  the  measure ments.  In  practice  we 
have  found  that  it  is  often  necessary  to  statistically  constrain  the  measure¬ 
ment  bias  estimates  by  the  use  of  Bayesion  priors.  These  priors  are  assumed 
to  be  normal  usually  with  mean  zero.  Thus,  the  specification  of  the  priors 
is  reduced  to  the  specification  of  a  prior  variance  for  each  measurement  bias. 
There  are  no  rules  for  choosing  these  prior  variances  and  use  of  these  guesses 
may  result  in  rather  poor  bias  and  trajectory  estimates  in  which  the  estimated 
bias  vectors  are  too  long  and  some  of  the  biases  may  be  of  the  wrong  sign.  We 
have  developed  the  use  cf  techniques  very  similar  to  ridge  regression  to  treat 
this  problem.  The  use  of  ridge  regression  for  this  problem  results  in  signifi¬ 
cant  improvements  in  both  the  trajectory  and  bias  estimates.  We  demonstrate 
the  use  of  this  technique  on  several  real  trajectory  estimation  problems  which 
have  arisen  at  WSNR.  In  these  problems  we  are  estimating  a  trajectory  and 
measurement  biases  using  measurements  from  several  radars.  In  some  of  these 
problems  we  also  have  measurements  from  optical  tracking  systems  which,  since 
they  are  more  accurate  and  precise  than  radar,  we  use  to  prove  the  value  of 
the  ridge  regression  technique  for  obtaining  improved  radar  trajectory  and  bias 
estimates.  In  using  this  ridge  regression  technique  we  have  been  successful 
in  choosing  a  good  value  of  the  ridge  parameter  by  using  the  ridge  trace  method 
proposed  by  Koerl  and  Kennard.  The  ridge  trace,  although  it  is  successful  in 
obtaining  a  good  value  of  the  ridge  parameter,  is  ur.sati sfactory  in  an  auto¬ 
matic  data  processing  procedure  such  as  trajectory  dota  reduction  since  a 
ridge  trace  requires  humjn  visual  interpretation.  We  have  tried  some  auto¬ 
matic  methods  tor  selection  of  a  good  ridge  parameter  value.  So  far,  we 
have  been  unsuccessful  in  these  attempts  to  develop  an  automatic  method  for 
choosing  a  good  ridge  parameter  and  we  would  like  to  have  this  considered  as 
a  clinical  paper  for  the  purpose  of  obtaining  some  new  ideas  for  developing 
such  a  method. 

1.  TRAJECTORY  ESTIMATION.  Measurements  of  range,  azimuth,  and  eleva¬ 
tion  from  several  "Jinerent  radars  are  used  to  estimate  the  cartesian  posi¬ 
tion  coordinates  of  a  vehicle  trajectory  at  a  sequence  of  times,  t, ,  1  =  1 ,  N 
which  cover  the  entire  trajectory.  Since  the  measurements  are  subject  to 
systematic  errors  as  well  as  random  measurement  errors,  we  also  want  to  esti¬ 
mate  l  lie  syste  matic  error  p.u’.i  .nters  or  biases  in  audition  to  the  trajectory 
coordinates.  The  resulting  estimation  problem  is  a  combined  linear  and  non¬ 
linear  estimation  problem  in  which  the  trajectory  coordinates  appear  as  non¬ 
linear  p a ■v.is'i "it.  in  the  measure  : c ; 1 1 r,  and  the  biasir  appear  as  linear  para- 
•iv t  in  li:c  i  va  uiv.  icnts  . 
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Let  (x  • )  be  a  measurement  function  where  x^  is  the  cartesian  position 
vector  to  the  trajectory  at  time  t-.  If  we  haye  M  different  radars  observing 
the  trajectory,  then  a  =  1,  3M.  For  a  range  measurement  from  the  p^  radar 


ha(xi)  =  +  {yrV* +  (zi'2p)]1/2 


0) 


where  (Xp,  yp,  Zp)  are  the  cartesian  coordinates  of  the 

origin  of  the  local  cartesian  coordinate  system  at  the  p^-  radar.  For  an  azi¬ 
muth  measurement  from  the  p^-  radar  the  measurement  function  is 


ha(x1)  =  tan 


-1  X1 


yi  •  yp 


For  an  elevation  measurement  from  the 


h  (x<)  =  tan" 
a  i 


[  (xrxp)  +  (yryp)  ] 


Let  za(t^)  denote  the  observed  value  of  the  a—  measurement.  The  obser  j- 
tions  are  modeled  as 

Za(ti}  *  ha(xi}  +  ba  +  ea(l) 

where  b  is  a  constant  measurement  bias  and  e  (i)  is  a  zero  mean,  random 
measurement  error.  Let  b  be  a  3M- dimensional  bias  vector  b  *  [bj  b2—  b  |. 
Then  the  measurement  model  can  be  represented  as 

za(ti)  =  ha(x1}  +  Sab  +  ea(1> 


where  sq  Is  a  row  vector  with  a  one  in  the  a —  entry  and  zeros  in  all  other 
entries. 

sa  •  [0  0  — 0  ’  0  —0] 

-4 

Let  R  (t4)  be  known  variances  of  the  random  measurement  errors,  e  (t,).  A 
o  1  a  1 

normal  prior  with  mean  zero  and  diagonal  covariance  matrix  P  will  be  assumed 

for  the  bias  vector  b,  P  =  diag  (PQj,  a  *  1,  3M.  The  estimation  problem  to 

be  considered  is  to  minimize, 
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with  respect  to  x.,  i  *  1,  N  and  b.  Differentiating  (7}  with  respect  to 
and  b  results  in  the  nonlinear  normal  equations 

1  VVRa ,Cti>(2a(ti)-ha(xt>-sab)  *  0  1  *  l.N  (8) 

a=l 

N  3M  t  .  . 

*  ?,  SaR' WWJ-W-W  -  p  b  *  °  (») 

i-l  a- I 

*  A 

where  x^-  is  the  estimate  of  and  b  is  the  estimate  of  b.  In  (8)  Ha(x^) 

3h  (tj 

is  the  derivative,  — — —  .  In  order  to  solve  the  normal  equations,  thev 

3X1 

are  linearized  about  a  guess  trajectory,  x^S-  Let  x.  0)f  1  *  1»  N  and  b^s) 
satisfy  (8) ,  i.e. , 

3M 

Z  HaT(xi(s))R;1(t1)(za(t1)-ha(x1(s))-Sab(s))  »  0  1  ■  1,  M  (10) 

G=  1 

If  (8)  is  linearized  about  XjO)  and  b^,  we  obtain 

^xi~xi^  ^  =  *Ai  Ai  ,N+1  OD 

where 

Ai  a  ^  HaT(x1(S))R;1(t1)Ha(xi(s))  (12) 

ar  ! 

A1,Ntl  '  J1S„TR;1‘tf)Ha^i<S)>  03) 

Aj  is  3  X  3  and  M4p1  is  3  X  3H.  Linearizing  the  second  normal  equation,  (9), 
about  x^v  '  and  solving  for  b  gives  the  result, 


(14) 


/„\  iN3Mtt  nT  -1 

b  ‘  (p  * „i,s«  p,i  ttt)Vtf,A1.Nt1A(  \.nu 


> 


I 


sA'“i>t2«<t()-h>/5>)-sab(si) 
1=10=1  , 

,  N  311  -r  1 

~(P "V  e  z  sjk'ctjs  )bts) 

1=1  o=l  °  a  1  ° 


(11)  and  (14)  for  x.  -  and  b  -  are  the  basic  equations  for  traject¬ 
ory  estimation.  Th£  solution  to  the  normal  equations  are  obtained  by  suc¬ 
cessive  relinearization  and  solution  of  (8)  and  (9). 

2.  APPLICATION  OF  RIDGE  DEGRESSION.  Although  there  are  no  convergence 
problems  in  solving  the  nVnnVf  equations  iteratively  for  the  N-station  radar 
case,  another  problem  which  is  fairly  common  in  the  solution  of  linear  least 
squares  problem  occurs  quite  frequently  in  trajectory  estimation.  Very 

often,  the  estimate  of  the  bias  vector,  b,  converges  to  a  solution  which 
several  of  the  components  are  too  large.  Sometimes  the  bias  solution  is 
obviously  erroneous.  One  obviously  erroneous  case  which  frequently  arises 
is  that  the  elevation  bias  components  will  all  be  large  and  of  the  same  sign. 
This  problem  of  the  estimated  bias  vector  being  too  long  is  usually  attri¬ 
buted  to  multicollinearity  among  the  predictor  variables  in  the  linear  least 
squares.  The  problem  in  the  linear  estimation  case  is  often  successfully 
treated  by  some  method  of  biased  estimation.  This  problem  has  not  been  prop¬ 
erly  recognized  or  successfully  treated  when  it  arises  in  trajectory  esti¬ 
mation.  Although  the  existence  of  these  erroneous  bias  estimates  have  been 
recognized  in  trajectory  estimation,  the  source  of  the  difficulty  was  not 
properly  identified.  Some  workers  in  trajectory  estimation  have  stated  that 
the  existence  of  the  problem  demonstrated  the  need  to  specify  a  prior  distri¬ 
bution  in  order  to  "tie  down"  or  statistically  constrain  the  bias  estimates. 
Hence,  the  reason  we  have  included  the  prior  in  (7).  It  does  not  take  much 
experience  In  using  these  priors  for  radar  trajectory  estimation  to  realize 
that  the  problem  of  inflated  bias  estimates  is  usually  as  much  present  with 
as  without  the  prior.  There  are  no  rules  for  choosing  a  good  prior.  We  at 
first  attempted  to  treat  this  problem  by  introducing  a  ridge  parameter  a. 
Instead  of  minimizing  (7),  we  would  minimize 


:  3t  ♦  (mibV’t. 


06) 


Minimization  of  (15)  merely  introduces  a  factor  of  (1  +  \)  in  (14)  wherever 
P"1  appears.  We  denote  the  ridge  solution  as  b(x).  The  ridge  solution  re¬ 
duces  to 


b(x)  -  b(o)  =  -ACQ+AP-W’bto) 
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(16) 


iMfMlljirai  i 


where 


07) 


\ 


Q  =  P' 


N 

+  I 
id  ad 


3M  t  .1  M  T  _i 

1  Sa  Ra  (ti)Sa  "  ^N.N+lN  Ai,N+l 


i 

i 
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r 

i 

p 

r 

s 

l 
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3  CHOOSING  THE  RIDGE  PARAMETER-THE  RIDGE  TRACE.  How  should  the  ridge 
parameter  x  be  chosen?  The  graphical  ridge  trace  method  proposed  by  Hoerl 
and  Kennard  [1]  is  a  standard  method  for  choosing  x.  Consider  the  following 
example  from  WSMR  data.  We  have  three  radars,  R122,  R123,  and  R395  tracking 
a  level  flying  target  flying  at  about  30000  ft.  The  graph  of  Figure  1  shows 
the  relative  geometry  of  the  target  trajectory  and  radars.  The  diagonal  ele¬ 
ments  of  the  prior  covariance  matrix  used  in  this  example  were  Pn  *  P^  s 

P77  =  1  (Range  bias  elements),  P22  =  P33  =  (^R  122),  P55  =  P66  =  (^R  121), 

and  P$e  «  P99  =  O^R  395),  where  R,  is  the  average  range  from  the  j—  radar 


to  the  trajectory.  Figure  2  is  a  ridge  trace  for  this  example.  Note  X  = 

-1  corresponds  to  the  least  squares  solution.  We  quickly  learn  two  things 
by  examination  of  this  ridge  trace.  One,  the  range  bias  estimates  do  not 
stabilize  from  which  we  conclude  that  a  ridge  parameter  should  not  be  used 
on  the  range  bias  elements.  Two,  we  would  also  want  to  conclude  from  the 
ridge  trace  that  there  is  no  benefit  to  be  derived  from  the  use  of  a  prior 
on  any  of  the  range  bias  terms.  Note  that  stability  of  the  estimates  occurs 
near  the  least  squares  solution  which  corresponds  to  x  =  -1.  The  Bayesian 
solution  with  the  prior  specified  which  corresponds  to  x  =  0  is  not  plotted 
on  the  graph.  This  solution  has  the  bias  estimates, 


R122 

R123 

R395 

Range  (ft) 

27 

23 

-20 

Azimuth  (mr) 

-.02 

-.06 

.12 

Elevation  (mr) 

.09 

-.06 

-.05 

These  estimates  are  not  in  the  stability  region  of  the  ridge  trace.  Figure 
3  is  a  ridge  trace  for  this  same  example  but  without  a  prior  on  the  range 
bias  terms.  The  range  biases  are  now  only  indirectly  affected,  i.e.,  through 
the  angle  bias  estimate,  by  the  ridge  parameter  x.  The  value  X  -  -.99  ap¬ 
pears  to  be  a  good  choice  of  the  ridge  parameter.  The  following  table  con¬ 
firms  that  large  errors  are  present  in  the  least  squares  bias  estimates  and 
that  the  ridge  estimates  with  x  =  -.99  provides  a  much  better  solution.  The 
optics  solution  is  derived  from  the  azimuth  and  elevation  measurements  from 
several  optical  tracking  cameras.  The  camera  measurements  are  inherently 
much  more  precise  than  the  radar  measurements;  hence,  we  often  use  an  opti¬ 
cally  derived  solution  as  a  standard  against  which  we  compare  radar  perform¬ 
ance. 
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R122-R 

R123-R 

R295-R 

R122-A  R123-A 

R395-A 

R 1 22- E 

R 1 23- E 

/£555j: 

Optics  157.3 

152.9 

80.3 

.05X10'3  .02X10° 

.09X10*3 

.11X10° 

-.08X11° 

-.09X10* 

Ridge  (X-.99)  151.8 

148.2 

72.0 

0  -.06X10*3 

.09X10*3 

.12X10° 

-  .09X10° 

-.07X10* 

Least  squares  118.3 

114.8 

63.7 

.116X10°. 058X10°. 

148X10'-. 

737X10° 

-.947X10°- 

-.538X10* 

Even  though  we  believe  that  the  ridge  solution  provides  much  better 
trajectory  and  bias  estimates  from  an  N-station  radar  solution  than  does 
the  least  squares  solution,  we  have  tried  and  are  still  trying  to  develop 
a  practical  method  of  using  ridge  regression  in  trajectory  data  reduction. 
The  graphical  ridge  trace  method  of  Hoerl  and  Kennard  for  choosing  x  is 
unsatisfactory  for  trajectory  data  reduction.  The  quick  turnaround  time 
required  by  range  users  for  trajectory  data  products  makes  the  ridge  trace 
method,  which  requires  human  intervention,  impractical  for  routine  use. 

An  automatic  method  for  choosing  a  good  value  of  X  is  required  in  order 
that  the  ridge  method  be  practical  for  trajectory  data  reduction. 


4.  CHOOSING  THE  RIDGE  PARAMETER-MINIMIZING  THE  MEAN  SQUARE  ERROR. 

One  automatic  method  for  choosing  ridge  parameters  is  an  iterative  method 
proposed  by  Hoerl  and  Kennard  [2J.  This  method  is  developed  for  choosing 
the  ridge  parameter  vector  in  a  generalized  ridge  regression  problem.  For 
the  generalized  ridge  regression  in  the  radar  bias  estimation  application, 
we  want  to  choose  a  parameter  vector  x  having  components  x^,  i  =  1,  2M. 

Thus,  we  will  have  a  separate  ridge  parameter  for  each  of  the  angle  biases 
but  none  for  the  range  bias  terms.  Suppose  we  arrange  the  order  of  the 
biases  so  that  the  angle  biases  appear  in  the  first  2M  positions  of  b  and 
the  range  biases  io  the  last  M  positions  of  b.  We  partition  the  bias  esti 


mation  equation,  Qb  =  U  as 


Q  Rt 

b 

u 

a 

a 

= 

a 

R  Q„ 

b 

U 

r 

_ 

La 

r 

(18) 


In  the  above  we  have  replaced 
different  definition, 


the  matrix  Q  defined  in  (17)  by  the  slightly 


N  3M 


n  T  -1  N  T 

Q  =  r  i  s'r  '(t.)S  -  e  AT  A 

i=l  o=l  a  3  *  a  i=i  ' ,N+lfti ,N+ 1 


(19) 


Note  that  we  have  abandoned  the  use  of  priors  in  the  definition  of  Q. 

Suppose  we  transform  the  bias  vector  bfl  as  bfl  =  Te  where  T  is  orthogonal. 
We  can  eliminate  br  from  (18)  obtaining. 
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(20) 


br  =  Q;1(Uf-RTe) 
and 

tt(q  -rVr)tb  *  tTmVu  (21) 

'  a  r  a  r  i 

we  choose  T  so  that 

TT(Qa-RTQ^R)T  =  r  (22) 

where  r  =  diag  (y,)-  Then  we  have 

re  «  TT(Ua-RTQ'1Ur)  (23) 

We  now  form  the  generalized  ridge  regression  as 


(r+D(x))(e(A)-6(o))  *  -0(x)p(o),  (24) 

where  D(a)  =  diag  (x,.)  and  b(o)  is  the  least  squares  solution. 

The  iterative  method  of  Hoerl  and  Kennard  attempts  to  choose  the  X.  to  mini- 

a  • 

mize  the  mean  square  error  in  b(x).  Thus,  we  want  to  minimize 

E[e(A)-0)T(g(A)-e)]  (25) 

where  B  is  the  true  parameter  value.  Minimizing  (25)  results  in  the  choice  of 
the  optimal  A  as 


(26) 


Since  the  true  values,  6^,  are  unknown,  (26)  cannot  be  implemented  exactly. 
Hoerl  and  Kennards  approximate  implementation  of  (25)  uses  the  iteration. 


.I-1*  =  (l/g(K)  )2  (27) 

where  B^  is  obtained  from  (24)  as 

ejK)*  Yiei(o)/(Yi+x|n);  (28) 

with  b\0)  =  B1  (o). 

Hemmerle  [3]  has  shown  that  a  closed  form  solution  of  this  iterative  scheme 
is  easily  obtained.  Hemmerle  shows  that  the  iteration  converges  to  e*  for 

0  <  (1/y^2(o))  -  1/4  where 
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(29) 


(l-2e  )  -  A^T 

e* =  zr 

0 


-2  «  (i/\ 

and  eQ  =  1/^B  ^(o).  For  eQ  >  1/4,  b^  '  — ►  0. 

For  our  three  radar  example  applications,  we  obtain  the  following  data 
for  the  application  of  the  above  method. 


ba(°) 

^(o) 

e* 

. 1 1 6X10" 3 

-.43181  17 4X1  O' 4 

_ _ 

4.1857985X1010 

78.04 

-.42616219X10"4 

-.737X10' 3 

-.54950324X10'4 

5. 7866501 XI 01 0 

174.73 

-. 5463401 1X10'4 

. 058X1 0*3 

. 14775931X10'3 

5.8775591X1010 

1283.41 

.14554624X10" 3 

-. 947X10’3 

- . 1 2048667X1 0-3 

8. 3670572X1 0* 

121.46 

-.11947337X1 0_  3 

. 1 48X1 0~  3 

-. 17659027X10'3 

8. 31 56874X1 07 

2.59 

0 

-.538X1 O' 3 

-.13022000X1  O' 2 

1 . 2606770X1 07 

21.37 

-.1237882X1  O'2 

where  6*  is 

the  limit  of  the 

iterative  scheme. 

This  method  is  clearly  inadequate  for  dealing  with  this  example,  since  the 
bias  vector  has  been  shrunken  only  a  very  small  amount  whereas  a  considerable 
amount  of  shrinking  is  necessary  in  order  that  the  estimated  bias  agree  reason¬ 
ably  well  with  the  biases  calculated  from  the  optics  solution.  The  diffi¬ 
culty  arises  from  those  least  squares  bias  estimates  which  are  far  from  their 
true  values.  When  the  true  values  are  replaced  by  these  estimated  values  in 
the  Hoerl  and  Kennard  method,  the  iteration  converges  to  the  wrong  values 
which  are  quite  near  to  the  least  squares  estimates.  We  have  not  been  able  to 
find  an  automatic  method  for  choosing  ridge  parameters  which  works  well  for  any 
of  our  examples. 

5.  APPLICATION  OF  PRINCIPAL  COMPONENTS  REGRESSION.  In  absence  of  a  method 
for  choosing  ridge  parameters  we  have  recently  begun  to  work  wi th  principal  com¬ 
ponent  regression  as  an  alternative  to  the  use  of  ridge  regression  for  shrink¬ 
ing  the  erroneous  least  squares  solution.  We  have  had  a  fair  amount  of  success 
with  this  method  on  several  examples.  In  the  principal  components  regression  we 
set  =  0  for  i  =  2M-r,  2M  where  r  is  the  smallest  integer  for  which 


We  are  assuming  in  (30)  that  the  eigenvalues,  ^ ,  haye  been  ordered  from 

largest  to  smallest.  The'use  of  10*2  as  a  cutoff  for  the  eignvalues  in  (30) 
is  arbitrary,  but  we  have  found  it  to  work  well  in  several  examples.  For  the 
example  application  described  above,  the  principal  component  estimator  sets 
p5  =  g6  =  0,  corresponding  to  the  two  smallest  eignvalues.  The  results  of 

the  principal  components  estimator  for  this  example  is  compared  below  with 
optics,  least  squares,  and  ridge  regression. 


R\2jSR 

R123-R 

R395-R 

R122-A 

R12J-.A 

R?95„-A, 

R122-.E  R123lL 

R395-E 

Optics 

i  57. 3 

152.9 

80.3 

.OSXIO*3 

,02Xi0  3 

■09X10*3 

.nxio'3  -.oexio*3 

- .  09X10*' 

Ridge  (>‘-.9S) 

00 

•l> 

143.9 

72.0 

0 

-.06X1  O' 3 

.05X1  O'3 

. 12X10* 3  -.0SX10*3 

-.07X10*  3 

Least  squcrc-s 

ns. 3 

114.8 

63.7 

•  116X10'3 

•058X10*3 

.148X10'3 

-.  737X10' 3  -  .9-47X1 0*3 

-. 638X1 C‘3 

Princi pal 
components 

147.9 

144. 3 

69.4 

0 

-.055X10'3 

•1C5X10*3 

.123X10'3  -.08SX10*3 

-.067X1C'3 

The  principal  component  method  has  been  criticized  as  being  too  restrict¬ 
ive.  Marquardt  has  suggested  that  a  fractional  rank  procedure  be  used  instead 
of  the  principal  components  integral  rank  procedure.  The  fractional  rank  esti¬ 
mator  takes  a  linear  combination  of  principal  components  estimators, 

6f  *  CBr  +  (1-c)^.  fl-c-1  (31) 

where  6p  is  the  principal  components  estimator  of  rank  r.  The  difficulty  with 

the  fractional  rank  procedure  is  in  the  choice  of  the  parameter,  c.  Marquardt 
[4]  suggests  using  a  graphical  method  like  a  ridge  trace  where  each  component 

of  the  vector  Bf  is  plotted  against  the  fractional  rank,  f  =  cr  +  (l-c)(r+l). 

Another  procedure,  suggested  by  Hocking,  Speed,  and  Lynn  [5],  for  choosing  c 
is  to  minimize  the  mean  square  estimation  error.  This  method  is  implemented 
in  a  way  similar  to  the  iterative  method  of  Hoerl  and  Kennard  for  choosing 
the  generalized  ridge  parameters  and  suffers  the  same  difficulties  in  appli¬ 
cation. 

6.  CONCLUSIONS.  It  has  been  demonstrated  by  simulation  studies  [6] 
that  ridge  regression  offers  potentially  a  better  estimator  than  the  princi¬ 
pal  component  technique,  but  that  a  better  estimator  of  the  ridge  parameter 
is  necessary  before  that  potential  can  be  realized.  I  definitely  believe 
that  this  conclusion  is  correct  for  our  trajectory  estimation  applications. 

I  also  feel  that  a  similar  conclusion  could  be  demonstrated  for  the  fract¬ 
ional  rank  procedure.  We  are  unable  to  implement  these  potentially  better 
methods  in  a  routine  trajectory  data  reduction  because  we  are  presently 
unable  to  develop  a  good  estimator  for  the  parameters  in  either  method. 

We  are  greatly  in  need  of  sane  fresh  ideas  for  choosing  the  parameters  in 
these  biased  estimation  methods. 
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AN  EIGHT  VARIABLE  COMPOSITE  DESIGN  FOR 
FITTING  A  SECOND  ORDER  RESPONSE  SURFACE 
Carl  B.  Bates 

US  Army  Concepts  Analysis  Agency 
Bethesda,  Maryland  20014 

ABSTRACT.  An  electronic  warfare  study  has  been  initiated  at  the 
Concepts  Analysis  Agency.  The  study  is  to  provide  justification  for 
procurement  and  force  structuring  decisions  concerning  new  electronic 
warfare/signal  intelligence  (EW/SIGINT)  systems.  A  communications 
model,  Communications  Electronics  Warfare  Combat  Simulation  Model 
(COMMEL  1 1.5),  will  be  used  to  investigate  the  effectiveness  of  US  Ew / 
SIGINT  systems  against  enemy  command,  control,  and  communications  (C3) 
systems.  COMMEL  11.5  is  estimated  to  require  four  hours*  running  time 
to  simulate  an  eight-hour  battle.  Eight  model  input  variables  having  6, 
6,  4,  4,  4,  3,  2,  and  2  levels,  respecci vely,  were  selected  for  the  in¬ 
vestigation.  The  objective  is  to  fit  a  second  order  response  surface 
using  as  small  a  number  of  computer  runs  as  possible.  A  1/4  x  23  frac¬ 
tional  factorial  design  is  augmented  with  the  addition  of  axial  points. 
The  resulting  variation  of  a  central  composite  design  contains  80  design 
points.  The  experimental  design  is  presented  and  discussed. 

1.  INTRODUCTION.  The  Concepts  Analysis  Agency  has  been  tasked  to 
analyze  the  relative  contribution  of  US  electronic  warfare  systems  to 
the  outcome  of  ground  combat.  A  wide  variety  of  new  electronic  warfare/ 
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signal  intelligence  (EW/SIGINT)  and  weapon  systems  has  been  introduced 
for  the  post-1980  timeframe.  Recent  assessments  of  US  and  Soviet  com¬ 
mand,  control,  and  communications  (C^)  show  the  need  for  improving  the 
quantified  analysis  of  EW  assets  to  counter  threat  C^.  An  analytical 
basis  is  needed  to  provide  justification  for  procurement  and  force 
structuring  decisions  with  respect  to  EW/SIGINT  systems.  The  analysis 
should  provide  detail  sufficient  to  assess  the  potential  of  selected 
electronic  countermeasures  (ECM),  electronic  warfare  support  measures 
(ESM),  and  tactical  S1GINT  systems. 

To  accomplish  the  task,  the  Concepts  Analysis  Agency  initiated  the 
Force  Electronic  Warfare/Tactical  SIGINT  (FEWTS)  Study.  The  purpose  of 
the  study  is  to  analyze  the  relative  contribution  to  combat  potential  of 
the  denial,  destruction,  and  exploitation  of  threat  through  the 
application  of  US  £W/tactical  SIGINT  means. 

2.  PROBLEM  DESCRIPTION.  An  enhanced  version  of  the  Communications 
Electronic  Warfare  Combat  Simulation  Model  (COMMEL  1 1.5)  was  selected 
for  use  in  the  study.  COMMEL  11.8  is  a  fully  computerized,  dynamic, 
two-sided,  division  level  ground  combat  model.  It  will  be  used  to  in¬ 
vestigate  the  effectiveness  of  US  EW/SIGINT  systems  ayainst  enemy  C^ 
systems.  The  model  permits  detailed  observation  of  communication  events 
in  a  combat  environment,  and  provides  a  tool  for  measuring.  In  terms  of 
combat  outcome,  the  merit  of  selected  EW/tactical  SIGINT  capabilities 
against  a  threat. 


tight  model  input  variables  were  selected  for  the  investigation. 

The  variables  represent  equipments  and  operating  characteristics.  The 
number  of  levels  selected  for  the  eight  variables  ranged  from  two  to 
six.  The  eight  variables  and  their  levels  are  given  in  Table  1.  The 
full  6^  x  43  x  3  x  2^  design  has  over  27,000  variable  level  combinations. 

Table  1.  Design  Variables 


Variable 


Levels 


-  TACJAM 
x2  -  TLQ  17/A 
x3  -  QUICK  FIX 
x4  -  TRLBLZR 
x5  -  CR1T  NODES 
x6  -  JAM  vs  L/K 
x7  -  E/W  EMPL  CON 
x8  -  ARTLY  EMPL  CON 


0,  1,  2,  3,  6,  9 
0,  1,  2,  3,  6,  9 
0,  1,  3,  6 
0,  1,  2,  3 
1.  2,  3,  4 
1.  2,  3 
1,  2 
1,  2 


Nine  tentative  measures  of  effectiveness  (MOE)  were  identified. 

They  consisted  of  Red  and  Blue  materiel  and  personnel  losses  and  forward 
edge  of  the  battle  area  (FEBA)  loss.  All  nine  MOE  o'*?  continuous  vari¬ 
ables.  The  study  members  desired  a  second  order  response  surface  for 
each  of  the  nine  MOE  in  terms  of  the  eight  mode’  input  variables.  The 
second  order  model , 


y  a#0  +  01*1  +  02*2  +  03*3  +  04*4  +^5X5  +  06*6  +^7X7  +^8X8 
+  011*1  ♦  022*2  +  033*3  +  044*4  +  055*5  +  066*6 
+  012*1*2  + 013*1*3  + 014*1*4  +  + 017*1*7  ♦ 018*1*8 

+  023*2X3  +  024*2*4  +  •••  ♦  027*2*7  +028*2*8 

♦067*6*7  +  068*6*8 
♦078*7*8* 

has  43  terms  (8  linear,  6  quadratic,  28  cross-products,  and  the  inter¬ 
cept  term).  The  eight  independent  variables  (x's)  represent  the  eight 
COMMEL  1 1.5  model  input  variables  and  the  dependent  variable  (y)  denotes 
a  particular  COMMEL  11.5  model  output  variable,  MOE. 

3.  BACKGROUND .  Box  and  Wilson  (1951)  introduced  the  concept  of 
composite  designs;  Box  and  Hunter  (1957)  introduced  the  concept  of  ro- 
tatability;  and  Box  and  Uraper  (1959)  developed  criteria  for  selecting 
response  surface  designs.  Hill  and  Hunter  (1965)  and  Mead  and  Pike 
(1975)  give  reviews  of  the  developments  of  response  surface  methodology. 
More  readily  available  sources  on  response  surface  methodology  can  be 
found  in  Cochran  and  Cox  (1957),  Davies  (1960),  Myers  (1971),  and  Ander¬ 
son  and  McLean  (1974). 

Composite  designs  are  full  or  fractional  21  factorial  designs  aug¬ 
mented  with  additional  points  which  permit  estimation  of  the  quadratic 
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coefficients  of  a  second  order  surface.  The  augmentation  consists  of  2k 
plus  one  (or  more)  center  points  as  Illustrated  in  Table  2.  Therefore, 
the  composite  design  consists  of  (2k  ♦  2k  *  U  design  points. 


Table  2.  2k  +  1  Augmentation 


X1 

x2 

x3 

•  •  • 

xk 

-a 

0 

0 

•  •  • 

0 

+<* 

0 

0 

•  •  • 

0 

0 

-a 

0 

•  •  • 

0 

0 

■Hi 

0 

•  •  • 

0 

0 

0 

-a 

•  •  • 

0 

0 

0 

•  *  • 

0 

0 

0 

0 

•  •  • 

-<l 

0 

0 

0 

•  •  • 

•Hi 

0 

0 

0 

•  •  • 

0 

If  k«3  and  the  23  coded  x-values  are  +1  and  -1 ;  t  '.e  design  matrix 
for  a  central  composite  design  is  as  shown  in  Table  3  and  illustrated  in 

Figure  1. 
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Table  3.  Three  Variable  Central  Composite  Design  Matrix 


X1 

x2 

x3 

-1 

-1 

-i  \ 

-1 

-1 

1  ) 

-1 

1 

-1  f 

-1 

1 

1  \  2*  factorial 

1 

-1 

-1  ( 

1 

-1 

1  \ 

1 

1 

-1  1 

1 

1 

1  / 

-<» 

0 

0  \ 

+ <1 

0 

0  / 

0 

-a 

0  (  2x3  axial  points 

0 

♦  /» 

0  ( 

0 

0 

-a  \ 

0 

0 

•f  a  I 

0 

0 

0  ■*-  -  —  center  point 

The  literature  on  central  composite  designs  contains  discussions  on 
the  determination  of  a  to  yield  orthogonal  designs.  However,  no  infor¬ 
mation  was  found  applicable  to  the  above  described  problem  in  which  the 
x-values  are  prescribed  and  fixed.  The  following  section  discusses  the 
attempts  to  develop  the  eight  variable  composite  design. 

4.  CANDIDATE  DESIGNS 

a.  Design  A.  A  "Base  Case"  situation  was  defined  early  in  the 
study  planning  phase.  The  Base  Case  combination  of  the  levels  of  the 
eight  variables  is  shown  by  the  circled  values  in  Table  4. 


Table  4.  8ase  Case 


X1 

x2 

x3 

x4 

x5 

x6 

x7 

x8 

0 

0 

0 

0 

1 

1 

(D 

CD 

1 

1 

1 

0) 

(D 

© 

2 

2 

2 

2 

(D 

2 

3 

3 

(D 

G> 

6 

3 

4 

6 

6 

9 

9 
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First,  a  fractional  factorial  design  was  developed.  The  lowest  and 
the  highest  values  of  each  of  the  eight  variables  were  considered  as  the 
"low"  and  "high"  values  of  a  2®  factorial.  A  resolution  V  2®"^  frac¬ 
tional  factorial  design  was  developed  using  I  =  ABCE6  ■  ABDFH  =  CDEFGH 
as  the  defining  contrast.  This  gave  64  design  points.  The  resolution  V 
design  permits  fitting  the  8  linear  terms  and  the  28  cross-product 
terms.  Table  5  repeats  the  Base  Case  values  given  in  Table  4  and  illus¬ 
trates  the  high  and  low  values  used  in  the  fractional  factorial.  Aug¬ 
mentation  of  the  fractional  factorial  then  consisted  of  using  "inside" 
values  as  the  axial  points.  This  gave  the  16  design  points  shown  in  the 
lower  portion  of  Table  5.  Note  that  for  x^,  xg,  x3,  and  x6,  Base  Case 
values  were  treated  as  center  points,  but  x4  and  x&  were  balanced  over 
the  two  inside  values,  as  were  x^  and  xg.  This  gave  a  total  of  80  de¬ 
sign  points.  The  composite  design  matrix  A,  however,  was  singular. 
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b.  Design  B.  For  the  second  attempt  to  develop  a  composite  de¬ 
sign,  the  same  fractional  factorial  design  was  retained,  but  the  16  aug¬ 
mentation  points  were  changed.  The  inside  values  for  all  variables  ex¬ 
cept  Xg  were  balanced  over  the  16  design  points.  Variable  x§  was  set  at 
its  Base  Case  value  (center  point).  The  augmentation  part  of  the  design 
is  shown  in  the  lower  portion  of  Table  6.  This  composite  design  matrix 
was  also  singular. 

c.  Design  C.  The  third  attempt  involved  the  same  fractional  fac¬ 
torial  design,  but  the  high  and  low  values  were  changed.  This  time  two 
adjacent  inside  values  were  used  as  the  high  and  low  values.  One  of  the 
adjacent  values  was  the  Base  Case  value  (center  point).  The  high  and 
low  values  used  are  shown  in  the  center  portion  of  Table  7.  For  vari¬ 
able  x6,  which  has  only  one  inside  value,  one  outside  value  (3)  was 
used. 

For  the  augmentation  portion  of  the  composite  design,  outside  val¬ 
ues  were  used  as  the  axial  points  for  x1  through  xg.  The  next  two  in¬ 
side  values  (1  and  6)  were  used  as  another  pair  of  axial  points  for 
variables  x^  and  x^  (points  13  and  14,  and  points  15  and  lb).  All  other 
variables  were  held  fixed  at  their  center  points.  Variables  x^  and  xy 
were  not  varied,  all  lb  points  were  hold  fixed  at  their  center  points. 


Table  7.  Design  C 


X1 

x2 

x3 

x4 

x5 

x6 

x7 

x8 

0 

0 

0 

0 

1 

1 

© 

© 

1 

1 

1 

CD 

© 

© 

2 

2 

2 

2 

© 

2 

3 

3 

<D 

© 

6 

3 

4 

6 

6 

9 

9 

fractional  Factorial 

Low 

2 

2 

1 

1 

2 

2 

1 

1 

High 

3 

3 

3 

2 

3 

3 

2 

2 

Augmentation 

1 

3 

3 

1 

2 

2 

1 

1 

2 

0 

3 

3 

1 

2 

2 

1 

l 

3  . 

3 

(81 

3 

1 

2 

2 

1 

l 

4 

3 

0 

3 

1 

2 

2 

1 

l 

5 

3 

3 

P| 

1 

2 

2 

1 

1 

6 

3 

3 

1 

2 

2 

1 

1 

7 

3 

3 

3 

|8| 

2 

2 

1 

1 

8 

3 

3 

3 

2 

2 

1 

1 

9 

3 

3 

3 

1 

01 

2 

1 

1 

10 

3 

3 

3 

1 

0 

2 

1 

1 

11 

3 

3 

3 

1 

2 

(lj 

1 

1 

12 

3 

3 

3 

1 

2 

1 

1 

13 

[i| 

3 

3 

1 

2 

2 

1 

1 

14 

0 

3 

3 

1 

2 

2 

1 

1 

15 

3 

(T| 

3 

1 

2 

2 

1 

1 

16 

0 

3 

1 

2 

2 

1 

1 

Design  matrix  C  was  nonsingular;  its  rank  was  43.  Consequently, 
the  full  43-term  second  order  response  surface  model  given  above  in  the 
second  section  can  be  fitted.  To  date,  other  nonsingular  design  mat¬ 
rices  have  not  been  developed.  Therefore,  the  design  matrix  C  has  not 
been  compared  against  other  nonsingular  design  matrices.  The  determi¬ 
nant  of  (C'C)  was  evaluated,  however,  and  |C'C|  *  (3.94)  x  10^.  Also, 
the  variances  of  the  43  regression  coefficients  were  obtained  and  are 
tablualted  in  Table  8.  The  variance  of  bg  is  125.2.  The  variances  of 
the  regression  coefficients  of  the  8  linear  terms  range  from  1.9  to 
16.4;  the  variances  of  the  regression  coefficients  of  the  6  squared 
terms  range  from  0.002  to  0.456.  The  variances  of  the  regression  coef¬ 
ficients  of  the  cross-product  terms  range  from  0.051  to  0.512. 
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Table  8.  Variances  of  Regression  Coefficients 


Regression 

Coefficient 

Variance 

Regression 

Coefficient 

Variance 

Regression 

Coefficient 

Variance 

bo 

125.200 

b12 

0.194 

b35 

o.053 

bl 

5.172 

b13 

0.512 

b36 

0.057 

b2 

5.172 

b14 

0.212 

b37 

0.057 

b3 

1.900 

b15 

0.212 

b38 

0.057 

b4 

10.350 

b16 

0.215 

b45 

0.218 

b5 

9.855 

bl7 

.  0.220 

b46 

0.230 

b6 

16.400 

b18 

0.220 

b47 

0.234 

b7 

6.838 

b23 

0.051 

b48 

^0.234- 

b8 

6.838 

b24 

0.220 

b56 

0.220 

bU 

0.002 

b25 

0.202 

b57 

0.225 

b22 

0.002 

b26 

0.215 

b58 

0.225 

b33 

0.008 

b27 

0.220 

b67 

0-239 

b44 

0.382 

b28 

0.220 

b68 

0.239 

b55 

0.173 

b34 

0.023 

lj/8 

0.239 

b6b 

0.456 

i 

i 


i 


1 
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5.  CONCLUSIONS.  The  procedure  employed  above  can  be  applied  to 
develop  second  order  response  surface  desi9ns  for  situations  in  which 
the  variable  levels  are  prescribed  and  fixed.  However,  a  systematic 
method  for  development  of  the  composite  design  is  needed.  The  designs 
attempted  suggest  that  Inside  variable  values  should  be  used  in  the  fac¬ 
torial  or  fi actional  factorial  portion  of  the  composite  design  and  that 
the  axial  points  should  lie  on  or  outside  the  k-diinensional  cube  of  the 


factorial . 
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AN  APPLICATION  OF  ORDER  STATISTICS  TO  TIME-SEQUENCE  LOGIC 


Wi Ilian  E.  Baker  and  Malcolm  S.  Taylor 
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US  Army  Ballistic  Re«earch  Laboratory 
Aberdeen  Proving  Jro-'i  .  Maryland 


ABSTRACT.  If  Xj,  X2,  • Xn  are  independent,  identically-distributed 

random  variables,  then  Yj  <  Y2  <  • . *  <  Yn,  where  the  Y^'s  are  the  X^'s 

rearranged  in  order  of  increasing  magnitudes,  are  defined  to  be  the 
order  statistics  corresponding  to  the  original  random  sample.  Order 
statistics  have  been  applied  to  the  solution  of  a  problem  involving  the 
determination  of  time  windows  for  firing  impulses  in  a  fuzing  system.  A 
computer  program  has  been  written  which  provides  the  probability  of  a 
warhead  fuzing  as  a  function  of  the  parameters  which  characterize  the 
detonators.  Conversely,  given  a  required  probability  of  fuzing,  the 
program  will  determine  the  necessary  detonator  characteristics.  Although 
motivated  by  this  specific  problem,  the  work  is  general  in  nature  and 
should  have  additional  applications  in  the  armament  research  and  develop¬ 
ment  community. 

1.  INTRODUCTION.  Let  Xj,  X2,  ...,  Xn  be  independent,  identically- 

distributed  random  variables.  Then  Y.  <  Y-  <  ...  <  Y  ,  where  the  Y, 's 

i  *  n  i 

are  the  X^'s  rearranged  in  order  of  increasing  magnitudes,  are  defined 
to  be  theorder  statistics  corresponding  to  the  original  random  sample. 

Order  statistics  find  immediate  application  in  the  design  and  evaluation 
of  logical  structures  which  make  decisions  based  on  the  relative  values 
assumed  by  a  set  of  n  random  variables.  One  particularly  interesting 
application  involves  the  determination  of  time  windows  for  firing  impulses 
in  a  fuzing  system.  This  is  the  problem  which  motivated  the  work  on 
which  we  are  reporting.  However,  the  work  is  general  in  nature  and  may 
prove  useful  for  other  applications  in  the  armament  research  and  development 
community. 


2,  STATEMENT  OF  THE  PROBLEM.  In  the  particular  problem  which  we 
addressed,  a  fuze  contains  N  detonators,  K  of  which  must  function  within 
a  specific  time  span.  Furthermore,  the  second  detonator  (which  functions 
at  time  Y2)  partitions  the  time  span  into  two  subintervals.  The  first 

subinterval  (Y2"  6j,  Y2]  is  examined  to  determine  if  the  first  detonator 

functioned  within  that  time  segment,  and  the  second  subinterval  (Y2, 

♦  62]  is  monitored  to  count  the  number  of  additional  detonators  activated 


during  that  period  of  time. 

K  detonators  have  functioned, 


If,  within  the  time  interval  [Y2  -  6j,  Y2  ♦  $ 
then  the  command  to  fire  will  be  initiated; 


otherwise,  it  will  not.  The  times  to  function  for  the  detonators  are 
random  variables  and,  as  such,  can  be  characterized  by  a  cumulative 
distribution  function  F.  Assuming  that  the  time  to  function  of  each 
detonator  is  identically  distributed,  then  the  problem  consists  of 
expressing  the  probability  of  fuzing  as  a  function  of  K,  N,  6j,  and  F* 

3,  SOLUTION.  Let  X^  be  the  time  to  function  of  detonator  i  in 
its  operating  environment.  Then  Xj,  X2,  — ,  XN  are  independent,  identically- 
distributed  random  variables;  and  we  can  define  Yj,  Y^,  . ...  YN  to  be 
the  order  statistics  corresponding  to  the  X^'s.  For  our  problem,  if 
Y2  -  Yj  <  we  are  interested  in  the  probability  that  Y^  -  Yj  <  d2;  however, 

if  ^2  *  Tj  >  6j,  we  need  to  determine  the  probability  that  Y^+j  -  Y2  <  6., 

assuming  K  ♦  1  <  N.  Therefore,  we  need  to  evaluate 

Pr  (warhead  fuzing}  =  Pr  {Y2  -  Yj  <  Pr  {Yj,  -  Y2  <  62  I  Y2  *  Y1  <  ^ 

♦  Pr  (Y2  -  Yt  >  6j }  Pr  {Yk+1  -  Y2  <  62  I  Y2  *  Yi  >  V  •  CD 

Applying  the  definition  of  conditional  probability  we  obtain 

Pr  {warhead  fuzing}  =  Pr  {Y2  -  Yj  <  dj)  Pr  {Y^  -  <  d2  and  Y2  *  Y1  ^  ®j} 

Pr  {Y2  -  Yx  <  6j} 

*  f  (v2  ■  h  >  V  Pr  <¥>C.l  -  ¥2  *=  *2  *2  -  Y1  >  V  ,  (2, 

Pr  (Y2  -  Yj  >  «j} 

which  upon  simplifying  yields 

Pr  {warhead  fuzing}  =  Pr  {Y^  -  Y2  <  and  Y2  "  Y1  < 

♦  Pr  {Yk+j  -  Y2  <  62  and  y2  -  Yi  >  •  (3) 

Defining 

a 

F(a)  =  /  f (x)dx ,  (4) 

-t© 

we  can  jirocccd  to  evaluate  the  first  term  on  the  right-hand  side  of 
Lquat  ion  3.  As  shown  in  Appendix  A  we  can  obtain  the  joint  probability 
density  function  of  the  1st,  2nd,  and  Kth  order  statistics. 


(5) 


f(yl*  y2*  yK}  “  (li-S) !  (N-K)  1  f(ylJ  f(y2J  tF(yK^  '  F(y2^ 

•  f(yK)  U  -  F(yK)]N’K 
If  we  let  u  *  yK  -  y2.  v  a  y2  "  yi»  am*  w  a  then  we  can  rewrite  Equation  5, 

f(u,v,w)  =  rKT3T?(N-10T  ftw)  f(v*w}  (F(u+v+w)  -  F(v+w)]K’3 

•  f(u*v+w)  [1  -  F(u+v+w)]^*^,  (6) 

and  the  desired  probability  is 

♦«°  6,  6. 

f  /  /  f(u,v,w)  du  dv  dw  (7) 

_«  o  o 

which  is  equal  to 

TK-3)T(TN~Kyr  [„  f(w)  C  £(V+W)  K2  [F(u+V+w)  *  F(v-w)]K'3 

•  f(u+v*w)  [1  -  F(u+v+w)]^~*  du  dv  dw  .  (8) 


In  an  analogous  manner  we  can  obtain  the  joint  probability  density 
function  of  the  1st,  2nd,  and  K*lst  order  statistics;  and,  letting 
u  =  yK+j  -  y2»  v  =  y2  -  y^  and  w  =  y^ ,  we  can  obtain  the  necessary 

probability  for  the  second  term  on  the  right-hand  side  of  Equation  3. 

That  probability  is  equal  to 

rrerrmWr  tM  C  f(v*’°  ^2|F(U*V*“)  -  F<V-“»K'2 

6i 

•  f(u+v+w)  (1  -  F(u+v+w) ]N'K"1  du  dv  dw.  (9) 

The  probability  of  fuzing  is  then  just  the  sum  of  Equation  8  and  Equa¬ 
tion  9. 

Appendix  B  contains  a  computer  program  which  evaluates  these  inte¬ 
grals.  In  its  current  form  it  assumes  that  the  distribution  of  the 
times  to  function  of  the  detonators  is  normal;  however,  it  can  be  easily 
modified  to  change  this  distribution.  The  program  requires  as  input  N, 

K,  6^,  62,  and  a  (standard  deviation  of  the  assumed  distribution). 


43 


4.  RESULTS.  For  the  problem  we  addressed,  the  value  for  o  was 

specified  to  be  10~^  seconds.  Figure  1  presents  the  results  of  changing 
6.  and  6-  for  a  fuzing  system  in  which  X  is  equal  to  N-l.  In  Figure  2_. 
we  considered  a  fuze  with  eight  detonators;  and,  keeping  a  equal  to  10*  , 
we  varied  K  as  well  as  6^  and  $2*  Of  course,  given  any  four  of  the 

five  input  variables  (o,  K,  N,  6j,  and  62)*  we  can  obtain  a  specific 

probability  of  warhead  fuzing  by  parametrically  varying  the  remaining 
variable. 


FIGURE  2.  PROBABILITY  OF  WARHEAD  FUZING  IN  A  K  OUT  OF  8  FUZING  SYSTEM  (o  «  .00001 ). 


APPENDIX  A.  As  shown  in  Reference  1  we  can  obtain  the  joint  probability 
density  function  for  Yj,  Y^,  and  Y^.  Let  Xj,  X2,  . . . ,  X  be  a  random 

sample  from  a  population  with  density  f(x)  ■  F’(x);  and  let  Yj,  Y2»  . ...  Yn 

be  the  corresponding  order  statistics.  Then  the  probability  density 
function  for  the  rth  order  statistic  may  be  derived  by  considering  the 
following  configuration: 


r-1 


n-r 


xl 


!x+Ax 


That  is,  X^  <  x  for  r-1  of  the  x  <  <  x  *  Ax  for  one  X^,  and 

Xj  >  x  ♦  Ax  for  the  remaining  n-r  of  the  X^.  The  number  of  ways  this 
combination  of  events  can  occur  is 

nl 

(r-1) 1  1!  (n-r) l  ’ 
and  each  such  way  has  probability 

[F(x)]rl  [F(x+Ax)  -  F(x)]1  (1  -  F(x+Ax)]n~r. 

Therefore,  we  have 

Pr  (x  <  Kr  <  x  .  lx)  .  Xirayr1  Cn-r^ 

•  (F(x)]rl  [F(x+Ax)  -  F(x) ]  (1  -  F (x+Ax) )nr  ♦  0(Ax2)  (Al) 

2  2 

where  0(Ax  )  means  terms  of  order  (Ax)  and  includes  the  probability 

of  realizations  of  x  <  Y„  <  x  ♦  Ax  in  which  more  than  one  X.  is  in 

r  1 

(x,  x  ♦  Ax).  Dividing  both  sides  of  Equation  Ai  by  Ax  and  then  letting 

Ax  0,  we  obtain 

fY  (x)  3  Tr^TTtwJT  lFCx))r_1  f  (x)  { 1  -F (x)  ]n  r.  (A2) 


In  a  similar  manner  we  can  derive  the  joint  probability  density 
function  of  Y  and  Yg : 


Ul 


and  through  an  analogous  argument  we  obtain 

fYplY#*X,y*  =  (r-l) !  (s-r-1) !  (n-s)  f  f(x^ 

•  [P (/)  -  Ffx)]3'1"1  f(y)  [1  -  F(y)]n"S.  (AS) 

Finally,  for  the  joint  probability  density  function  of  Y  ,  Ys,  and  Yt 


r-l 

X 

s-r-1 _ 

t-s-1 

n-t 

X 

x+Ax  y 

y+Ay  z 

i+hz 

and 

fYr,Y..Y  (x,y,z)  3  (r-l) !  (s-r-1)  I  (t-s-1) !  (n-t) !  fF(x^r  lf(x) 

*  [F(y)-P(x))S"r“l  f (y)  [F(Z)-F(y))t'S_l  f(z)  [1-F(z))n_t-  (A4) 

For  the  case  r»l,  s=2,  and  t=k,  we  obtain 

fYrY2.Yk(X'y'z)  =  rk-'3TRn-kyr  f(x)  f(y) 

•  lF(z)-F(y)]k'3  f(z)  [1-F(z)]n"k.  (AS) 
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APPENDIX  B.  We  are  presenting  here  the  computer  program  which  evaluates 
the  desired  probability  function.  As  noted  in  the  program  comments,  we  have 
assumed  that  the  times  to  function  of  the  detonators  are  normally  dis¬ 
tributed  with  mean  tero  and  variance  c2.  However,  with  just  a  few  changes 
to  the  subroutines,  a  different  distribution  may  be  assumed.  To  do 
this  all  cards  containing  "NORMAL"  in  columns  74  through  79  must  be 
replaced  by  others  with  the  appropriate  probability  density  function 
or  cumulative  distribution  function. 


nnnn  non  on  noooonoooooooooonnnnoono 


PROGRAM  OETON  (INPUT.  OUTPUT. TAPE5*1npuT.TARE6»0uTPUT) 


C 


THIS  PROGRAM  DETERMINES  THE  PROBABILITY  THAT  a  OUT  OF  N 

octonators  ■ ill  function  hithin  a  given  timespan.  k  rust 

BE  GREATER  Than  OH  equal  TO  3.  ANO  K  MUST  8E  LESS  THAN  OR 
EQUAL  TO  N.  THE  DISTRIBUTION  OF  ThEIR  FUNCTIONING  IS  ASSUMED 
TO  BE  GAUSSIAN  «ITh  A  MEAN  EQUAL  TO  0.0  ANO  A  STanOARO 
DEVIATION  EQUAL  TO  5IOMA.  THE  SOLUTION  IS  OERIVEO  THROUGH 
THE  USE  OF  OHOER  STATISTICS  AND  IS  OBTAINED  BT  EVALUATING 
A  TRIPLE  INTEGRAL. 

IF  THE  NUMBER  OF  DET  ON A  TORS  IS  GREATER  Than  THIRTY. 

THEN  THE  TOLERANCE  limits  OF  THE  INTEGRALS  MUST  BE  REDEFINED. 
THAT  IS.  THE  VARIABLE  'ERROR*  IN  Th£  MAIN  ROUTINE  AND  THE 
VARIABLES  *TOLR.TOLY*  IN  SUBROUTINE  *OIST*  ShOulO  BE  ADJUSTED. 
m ITh  THE  TOLERANCE  LIMITS  CURRENTLY  IN  THE  PROGRAM. 

THE  RESULTING  PROBABILITIES  APE  CORRECT  TO  Ti.0  DECIMAL  PLACES. 


INPUT  IS  AS  FOLLOWS  .... 

N  . . . 

K  . . . 

SIGMA  . . 

OELU  . 

OEuT  2  . 


NUMBER  OF  DETONATORS 

number  of  detonators  to  function 

STANOARO  DEVIATION  OF  DISTRIBUTION 
TIMESPAN  (FIRST  TO  SECOf.0  DETONATORS) 
TImESPAN  (SECOND  TO  LAST  0£ T ON ATOMS ) 


PEAL  NFACT.K2FACT.K3FACT,NKFACT.NK1FACT 


OtMENSION  lERfl(b) 

COMMON  /COM!/  n, a, SIGMA, OELU. OELT2 
COMMON  /COM2/  PI.m 
COMMON  /COM*/  INO 

EXTERNAL  OIST 

DATA  IERR  /3*(-0) ,0.2»<-0)/ 


C*LL  STSTEmc  (3A.IERR) 

CALL  STSTEMC  (US.  IERR) 

WRITE  (6.200) 

r£ao  Input 
initialise  variables 

5  READ  IS. 100)  N. K, sigma. OELTI .OCLT2 
IF  IK.LT.3  .OR.  K.GT.N)  GO  TO  15 
IF  <N  . GT .  30)  WRITE  (6.500)  N 
IF  (OELTI  ,3T.  10. 'SIGMA )  OELTI «1 0. *S IGMA 
IF  (OELT2  .GT.  10. ‘SIGmai  OElT2« 1 0 . «S ICMA 
«E$1»0. 

R£S2>0, 

PI  »3. 1*15026536 
NN*N/5*6 


SO 


IF  ( N  .EG.  SOI  NN«i2 
F;R»OH«10.»*  <-nn) 
a»io.»sigma 
*••10. 'SIGMA 

c 

c  evaluate  triple  integral 

c 

INO»l 

PESUSOUANR  (A.B.ERROR.RUN.OIST) 

I r  (N  .  EG  .  K I  GO  TO  T 

INO«2 

RE  S2-5QU ANK  (A.a.ERROR.RUM.OIST) 

C 

c  determine  factorials 

c 

T  CONTINUE 
nFACT«1 
K  2F  ACT* I 
K3FACT«1 
NKFACT-1 
nk1FACT«1 
00  10  I»1.N 
nFaCT-NFACT*! 


IF 

(I 

.l£. 

(K-2) 1 

K2F ACT  SK2FACT-I 

IF 

(I 

.lE. 

(K-31 ) 

K3FACT«K3FACT»I 

IF 

(I 

.LE. 

(N-K) ) 

NKFACTkNKFACT-I 

IF 

(I 

.LE. 

(N-K-l 

)  )  NKIFACT-NK1FACT*! 

10  CONTINUE 

c 

c  Compute  and  -RITE  PROBABILITIES 

c 

PROS 1=NF act/ (NKFACT-K  3FaCT ) -NES1 
oROB  2*NF  ACT  /  ( NK  IF  AC  T  *K  2F  AC  T  ’ -RES2 

PROBaPRORl *PR082 

wR  I TE  (6000)  N.K.SIOMA,OELTl.OELT2,PBOBl,PROB2,PaO<l 
00  TO  S 

IS  -«ITE  (6,400)  N,K 
STOP 

c 

loo  FORMAT  (2IS.3F20.T1 
200  FORMAT  I l M 1 // I 

300  FORMAT  (1M  »*HN  »  .I2.3X.4HK  ■  . I 2 . 3X . 8H5 I GMA  •  .F10.T.3X. 

•  BmOElU  ■  .FtS.e.3X.aM0ELT2  ■  .F15.0. 3X/1M0. 

•  3TMPR08A8ILITY  (1  THROUGH  K  FUNCTION)  ■  .F10.6/1H  » 

•  39HPROdA8 il I tt  (2  through  k*i  function)  -  .fs.*/1h  . 

•  22HPRORARILI TY  (TOtAL)  »  .F2S.6/////1 

400  FORMAT  (In  . .  INvalIO  VALUE  OF  N  OR  K  •••••»3X. 

•  «HN  ■  .I2.3X.4HK  a  ,12) 

500  FOfl“AT  (21H  •••••  -ARNlNU  -  N  •  ,12. 

•  51M,  CHECK  ThF  TOLERANCE  LImITS  CF  The  INTEGRALS  •••••///) 
END 

C 

c 

c 
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non  n  n  n  <->  n  n  non  r>  n  no  non  n  n 


FUNCTION  0 1ST  <*) 


THIS  ROUTINE  PROVIDES  ThE  INTEGHano  FOR  The  THIPO  INTEGRAL 

*s  •ell  as  the  limits  of  integration  for  the  second  integral. 

COhhON  /CON  1/  N.K.SIGHA.0ELT1.0ELT2 
COHMON  /C0H2/  f*I  » RN 
COHHON  /COM*/  INO 

EXTERNAL  FXX.FXY 

RMatf 

nn»N/5»6 

IF  <N  ,EQ.  50)  NN«1? 

TOLX«lO.»»(-NN> 

TOLY*l  <)•••<  -NN> 

?Hl wa l . /  (SORT (2.»PI ) »SlOHA i »ExP (- l . 1 2 . »S IGma««2) >  NORHAL 

IF  UNO  .£0.  2 )  GO  TO  S 

OOaNaO . 

UP«0ELTI 
GO  TO  10 
5  DORNaOELTl 
UP»2.»0ELU 

IF  (OELU  .LT.  10 . *S IGHA )  UPalO.aSIGHA 

10  CALL  OSLINT  ( UOaN . UP . FXX . F X Y , TOLA . TOL* ♦ ANS . RUM. R UHYm) 

0 1  ST*Ph  I'a»ans 

RETURN 

END 


SUBROUTINE  FXX  (V.YI.V2.Y3) 


THIS  ROUTINE  OROVIOES  Tn£  INTEGRAND  FOR  THE  SECOND  INTEGRAL 
AS  «ELL  AS  THE  LlHITS  OF  INTEGRATION  FOR  Th£  FIRST  INTEGRAL. 

COHHON  /COHl/  N,n, SIGMA. OELU.OELT2 
COMMON  /COH2/  RI,H 

PHlv*»l./(SQRT(2.«PI) *S IGHA 1 «EXP(-1.*(V»J **2/(2.*SIGHA»«2) )  normal 


YlsPHlVa 
Y2«0, 
v  3»0Et  T2 
uETuhn 
£hO 
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uuuu 


function  pat  <v«u> 


THIS  ROUTINE  PROVIDES  THE  INTEGRANO  FOR  THE  FIRST  INTEGRAL. 


C 


c 


c 

c 


c 


c 

c 

c 


COMMON  /CONI/  N«K«SI0NA»0EuTl»0El.T2 

common  /C0M2/  RI»“ 

COMMON  /COMA/  INO 

PMlUVV*! •/ ISORT  <2. «PI l “SIGMA) *EAP  CM »• <U*V»*1 **2/ (2.*SIGMA**8 1 ) 
CAPVWaFNO  ( (V««) /SIGMA1 
C APUVWaF NO  ( <U»V*PJ /SIQMA) 

IF  (INO  .EQ.  2)  60  TO  15 


IF  (N  .EQ.  31  60  TO  2 

IF  (N  ,E0.  A I  GO  TO  5 

IF  ( K  .EQ.  3»  60  '0  10 

FXY«(CAPvJVW-CAPVW>  ••(AM)  »PHlUVa*U 

GO  TO  25 
j  F*V«PMIUVM 

GO  TO  25 

5  FXta (CAPUVW-CAPVWl •• < A-3) •PMlUVP 
GO  TO  25 

l0  r*r-PHlUV«*(l.-CAPUVHl**(N-Al 

GO  TO  25 


-CAPuV*l»«(N-IO 


15  CONTINUE 


20 


IF  (N  .EQ.  (AMD  00  TO 
FXY* (CAPUV»-CAPyv I •• ( A -2  I •PNlUVa*  » 1 


,-CAPU/A) ♦♦(N-A-l) 


GO  TO  25 

FXV» (CAPUV«-CAPVVI 


(K-2DPHIUVP 


25  OfcTURN 
ENO 


NORMAL 

normal 

NORMAL 
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RADAR  ERROR  SIGNAL  IMPROVEMENT 


Robert  E.  Green 
Programs  Management  Office 
Instrumentation  Directorate 
US  Army  White  Sands  Missile  Range 
White  Sands  Missile  Range,  New  Mexico 


ABSTRACT.  Monopulse  tracking  radars  are  subject  to  pointing  errors 
that  are  induced  by  target-caused  signal  fluctuations.  These  signal  fluctu¬ 
ations  introduce  non-Gaussian  noise  into  the  angle  tracking  servo  error 
signals  of  the  radar.  This  paper  raises  the  question  of  the  efficacy  of 
making  corrections  for  these  errors  based  on  other  radar  measurements  that 
are  significantly  corrupted  by  noise.  Actual  radar  error  signals  are  dis¬ 
played  along  with  the  results  of  spectral  analysis  of  the  signals.  The 
spectral  analysis  confirms  the  presence  of  non-Gaussian  components  in  the 
error  signals. 

I,  INTRODUCTION.  One  of  the  types  of  devices  used  at  White  Sands 
Missile  Range  to  keep  track  of  missiles  in  flight  is  an  instrumentation 
radar.  These  devices  transmit  a  burst  of  energy  which  is  reflected  off 
the  target  and  back  to  the  radar.  Automatic  control  devices  in  the  radar 
keep  the  antenna  pointed  at  the  target.  The  measurement  of  the  time  inter¬ 
val  from  burst  transmission  to  echo  reception  permits  the  determination  of 
target  range;  and  the  position  of  the  antenna  pedestal  in  azimuth  and  ele¬ 
vation  permit  the  designation  of  target  position  in  polar  coordinates.  The 
radars  transmit  these  bursts  of  energy  at  rates  of  160,  320,  or  640  times 
per  second.  The  process  of  tracking  a  target  automatical  requires  that 
the  device  (radar)  sense  how  far  it  is  off  from  the  target  and  make  the 
necessary  corrections.  The  purpose  of  this  paper  is  to  define  a  problem 
that  has  been  detected  in  this  error  sensing  circuitry  and  solicit  sugges¬ 
tions  for  improving  radar  system  performance. 

II.  ERROR  SIGNALS.  Instrumentation  radars  of  the  type  used  at  White 
Sands  Missile  Range  utilize  a  monopulse  feed  system  to  generate  the  error 
signals  that  are  used  to  direct  the  tracking  mount  in  azimuth  and  elevation. 
This  monopulse  feed  system  uses  a  quadrangle  of  four  sensors  (Figure  1)  to 
determine  the  necessary  direction  to  drive  the  antenna  so  that  the  target 

is  centered  in  the  quadrangle. 


A  0  0  B 


C  °  0  B 


Figure  1.  Sensor  Quadrangle. 
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The  directional  error  is  measured  by  observing  the  difference  of  the  follow¬ 
ing  signal  levels: 

Azimuth  error  =  (A  +  C)  -  (B  +  D)  (1) 

Elevation  error  =  (A  +  B)  -  (C  +  D)  (2) 

These  sums  and  differences  are  formed  at  the  operating  radio  frequencies 
(5.5  GHz)  of  the  radar.  This  means  that  the  individual  signals  from  A,  B, 

C  and  D  are  not  available  for  processing.  In  the  radar  equipment,  these 
quantities  (1)  and  (2)  are  sensed  as  voltages  instead  of  digital  numbers. 

These  quantities  are  sensed  each  time  the  radar  receives  an  echo.  The  sig¬ 
nals  differenced  to  form  the  quantities,  (1)  and  (2)  are  of  almost  equal 
magnitude.  This  results  in  a  very  weak,  low  frequency  signal,  embedded  in 
a  large  amount  of  noise.  The  radar  equipment  uses  a  very  narrow  band  filt¬ 
ering  process  to  extract  the  signal  from  the  noise.  This  filtering  process 
is  accomplished  in  two  stages.  The  data  is  first  processed  through  a  low  pass 
filter  of  approximately  10  Hz  bandwidth.  The  filtered  signal  Is  then  applied 
to  the  servo  which  typically  has  a  bandwidth  of  approximately  3  Hz.  It  is 
not  possible  to  significantly  reduce  the  bandwidth  of  the  10  Hz  filter  and 
maintain  the  stability  of  the  servomechanism. 

The  noise  in  the  error  signals  has  two  major  contributing  sources.  One 
of  these  is  "thermal  noise"  and  is  naturally  occuring  in  the  environment. 

It  is  assumed  to  have  a  Gaussian  distribution  with  zero  mean.  The  other 
major  noise  source  is  contributed  by  the  response  of  the  radar  system  to 
changes  in  the  reflectivity  pattern  of  the  target.  The  reflectivity  pattern 
of  the  target  is  a  very  complex  function  that  changes  rapidly  with  changes 
in  target  aspect.  This  means  that  signal  amplitude  can  change  drastically 
between  two  successive  echos  from  the  same  target.  These  rapid  signal  fluctu¬ 
ations  are  the  origin  of  the  second  major  noise  source. 

III.  AUTOMATIC  GAIN  CONTROL.  The  gain  of  the  radar  receiver  performs 
the  same  function  as  the  volume  control  on  a  radio.  As  the  signal  gets  weak¬ 
er  the  volume  or  gain  is  increased  to  keep  the  output  at  a  constant  level. 

In  a  radar  system,  an  automatic  control  system  is  used  to  sense  the  average 
received  signal  level  and  adjust  the  receiver  gain  accordingly.  If  the  sig¬ 
nal  level  does  not  change  too  rapidly,  the  automatic  gain  control  system  main¬ 
tains  the  receiver  gain  at  the  correct  level  so  that  the  radar  system  func¬ 
tions  normally.  If  the  received  signal  level  changes  faster  than  the  auto¬ 
matic  gain  control  can  adjust,  then  the  noise  Is  Introduced  Into  the  error 
signals.  This  noise  is  deterministic  in  the  sense  that  if  the  receiver  gain 
setting  and  the  received  signal  level  are  known,  then  the  incorrect  value  for 
the  error  signal  can  be  predicted,  using  a  known  deterministic  function.  The 
previous  paragraph  indicated  that  the  radar  system  is  designed  to  respond 
only  to  low  frequencies.  This  would  make  it  appear  that  high  frequencies 
would  have  no  effect  on  the  system.  It  should  be  noted  that  the  radar  is  a 
sampled  data  system  and  that  frequencies  near  the  sampling  frequency  will 
appear  to  be  low  frequencies  due  to  aliasing. 


1 V .  ERROR  GRADIENT  CURVES.  The  error  gradient  curve  is  used  to  relate 
the  magnitude  of  the  error  to  the  voltage  sensed  (Figure  2).  This  function  is 
established  as  a  part  of  the  radar  system  set-up  procedure.  It  Is  also  used 
in  the  tracking  function  to  sense  how  far  the  target  is  off  center.  A  common 
set-up  would  be  to  assign  a  deviation  of  one  mill  1  radian  a  value  of  one  volt. 
Notice  that  the  curve  is  linear  in  the  region  where  the  deviation  does  not  ex¬ 
ceed  +1  milliradian.  This  is  the  region  where  the  radar  system  would  normally 
be  expected  to  operate.  In  the  radar  set-up  procedure,  the  deviations  shown 
in  Figure  2  are  assigned  for  the  voltages  sensed  in  the  azimuth  and  elevation 
error  signal  detection  circuitry.  These  values  are  correct,  as  long  as  the 
receiver  gain  is  set  at  the  appropriate  level  for  the  received  signal. 


Figure  2.  Error  gradient  curve 


The  automatic  gain  control  system  sets  the  receiver  gain  based  on  the 
average  value  of  the  last  few  signals  received.  When  a  received  signal  has 
a  significantly  different  amplitude  from  this  average,  it  has  a  profound 
effect  on  the  error  sensing  circuitry  of  the  radar  system.  Figure  3  Ulus* 
trates  how  the  error  gradient  curve  appears  to  the  error-sensing  circuitry 
when  the  received  signal  is  either  significantly  stronger  or  significantly 
weaker  than  the  average  value  over  the  last  few  samples.  Curve  A  illustrates 
how  the  error  sensor  reacts  if  the  received  signal  is  much  stronger  than  ex¬ 
pected.  In  this  case,  the  voltage  sensed  for  a  given  angular  deviation  is 
much  larqer  than  that  shown  in  Figure  2.  Curve  B  illustrates  how  the  sensor 
reacts  If  the  received  signal  is  much  weaker  than  expected.  For  this  condi¬ 
tion,  the  voltage  sensed  for  a  given  angular  deviation  is  much  smaller  than 
shown  in  Figure  2.  Since  Figure  2  was  used  to  calibrate  the  system,  these 
conditions  introduce  errors  in  the  sensed  angular  deviations.  In  actual  ex¬ 
perience  this  type  of  signal  fluctuation  occurs  frequently  in  tracking  targets 
such  as  missiles  and  aircraft. 
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V.  SPECTRAL  ANALYSIS.  The  Introduction  of  a  large  signal  fluctuation 
into  the  error  sensing  circuitry  of  the  radar  affects  the  system  for  a  period 
of  time.  This  is  due  to  the  action  of  the  filtering  process.  The  introduc¬ 
tion  of  an  impulse  into  the  system  lasting  only  one  sample  period  would  at 
first  appear  to  be  at  too  high  of  a  frequency  to  affect  the  system.  If  the 
amplitude  of  the  impulse  is  very  large  its  affect  will  be  spread  over  several 
samples  by  system  filtering  action  resulting  in  antenna  pointing  errors. 

Figure  4  shows  the  power  spectral  density  of  a  radar  error  signal  when  the 
radar  was  tracking  a  spinning  missile.  Note  there  is  power  at  low  frequencies 
near  the  servo  bandwidth  of  the  instrument.  Such  non-Gaussian  noise  will 
create  incorrect  responses  in  the  radar  angle  tracking  system.  Figure  5  shows 
similar  spectral  analysis  for  a  roll  stable  missile.  Notice  that  the  non- 
Gaussian  error  signals  occur  much  less  frequently  in  this  case.  This  indi¬ 
cates  that  rapid  changes  in  reflectivity  pattern  are  much  less  common  for  non¬ 
spinning  missiles.  Figure  6  is  a  recording  of  radar  error  signals  where  large 
signal  fluctuations  were  known  to  occur.  The  large  angle  errors  resulting 
from  these  signal  variations  are  evident  throughout  the  period  shown.  The 
data  presented  indicate  that  the  presence  of  large  short-term  signal  fluctua¬ 
tions  do  affect  the  radar  angle  error  signals. 

VI.  THE  PROBLEM  OF  CORRECTION.  The  composite  of  the  Gaussian  and  determ¬ 
inistic  noise  sources  results  in  a  function  such  as  the  one  shown  in  Figure  6. 
Observe  that  this  data  is  so  noisy  that  no  trend  can  be  discerned  by  Inspec¬ 
tion.  The  available  measurements  of  received  signal  level  are  also  very  noisy. 
In  spite  of  this,  it  is  possible  to  generate  the  required  set  of  error  gradient 
curves.  The  necessary  values  can  be  generated  by  fixing  the  radar  parameters 
and  then  observing  the  data  over  a  few  hundred  samples.  The  averages  of  these 
samples  have  the  expected  characteristics.  The  individual  samples  of  received 
signal  level  are  so  corrupted  by  noise  that  the  correction  of  individual  error 
signal  samples  may  not  result  in  an  improved  value. 

The  Information  presented  raises  the  following  questions: 

°  Will  applying  corrections,  based  on  functions  derived  from 
average  values,  produce  a  better  behaved  sequence  of  error  signals? 

0  Is  spectral  analysis  an  adequate  method  of  measuring  the 
improvement  resulting  from  the  correction  process? 

°  How  noisy  must  the  measurement,  of  signal  level  become  in 
order  to  make  the  correction  process  ineffective? 
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ABSTRACT 


The  Laplace  transform  is  an  extremely  versatile  tool  for  solving  differential 
equations.  The  s-plane  transfer  function  converts  the  problem  in  integration 
to  an  algebraic  one.  But  when  a  control  system  employs  digital  filters  in  an 
embedded  computer,  the  variables  are  necessarily  discrete,  and  the  problem  is 
better  stated  by  means  of  a  difference  equation.  The  operator  equivalence 
relationships  are 


Z  -  1  +  A  -  eD 

and  show  that  the  s-plane  transfer  functions  will  be  quite  complicated. 

To  circumvent  this  difficulty,  we  employ  the  Bilinear  Z- transform  which 
has  the  form 


1  F  1  ♦  BZ*1 
A  *  *  1  +  CZ  1 

and  maps  into  what  we  shall  call  the  r-plane,  where  the  transfer  functions  are 
well  behaved.  It  is  a  moat  useful  tool,  admirably  performing  its  mission  of 
simplifying  calculations,  but  seems  to  be  seldom  used  -  rarely  correctly. 

This  paper  re-examines  the  theory  of  the  Bilinear  Z-transform  utilizing 
two  new  parameters  (essentially  the  reciprocals  of  those  traditionally  used). 
It  is  felt  that  the  method  results  in  considerable  simplification  and  clarifi¬ 
cation. 

I.  BACKGROUND.  During  the  test  of  a  ballistic  missile  control  system,  an 
analysis  was  performed  of  the  digital  filters  used  in  the  attitude  compensa¬ 
tion  channels.  The  filters  were  implemented  by  cascading  two,  three  or  four 
stages  of  the  type 


JL.  .  (a  I 

(1  ♦  TZ-1)x 


(1) 


This  filter  is  implemented  in  a  digital  computer  by  the  following  two  succes¬ 
sive  steps: 


Xj  ‘ 

F1  "  u*l 


1*0 

(2) 

6X0 

(3) 
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Subscripts  refer  to  increments  of  time;  hence  the  sampling  interval  is  given 
by 


is  merely  a  convenient  computational  parameter. 

In  the  actual  case,  only  real  coefficients  were  encountered.  However,  the 
treatment  which  follows  can  be  extended  without  much  difficulty  to  inolude  the 
case  of  complex  coefficients. 

Several  shortcomings  were  found.  The  proximate  cause  is  easy  to  state  - 

incorrectly  computed  coefficients  and  too  many  stages  are  examples.  The  ulti¬ 
mate  cause,  however,  cannot  be  determined  with  any  certainty.  Perhaps  it  is  a 
lack  of  understanding  of  the  principles  of  digital  filters  on  the  part  of  both 
design  and  test  engineers. 

The  theory  of  digital  filters  is  the  theory  of  electrical  networks.  It  is 
not  surprising,  then,  that  the  bulk  of  the  literature  on  the  subject  has  been 
written  by  electrical  engineers,  and  is  couched  in  engineering  terms.  The 
frequency  domain  is  the  vehicle  of  thought. 

But  when  a  linear  control  system  is  operated  by  an  embedded  digital 
computer,  the  signal  to  be  sampled  often  is  virtually  aperiodic  in  nature, 
exhibiting  a  frequency  spectrum  that  is  quite  primitive  when  compared  with 
that  of  even  a  common  electrical  phenomenon.  Under  these  conditions,  restat¬ 
ing  matters  in  the  time  domain  results  in  worthwhile  simplification  and 
clarification. 

Accordingly,  the  theory  is  re-inveatigated,  utilizing  the  terminology  of 
the  time  domain.  Two  variables  are  identified  there,  and  expressed  in  units 
of  time  (e.g.,  seconds  or  sampling  intervals). 

II.  THE  PROBLEM  OP  FREQUENCY  FOLDING.*  Nearly  everyone  has  seen  in  the 
movies  the  spoked  wheel  which,  starting  from  rest,  turns  faster  and  faster 
until  a  speed  is  reached  (the  Nyquist  frequency)  where  the  spokes  appeer  to 
slow  down,  eventually  (at  twice  the  Nyquist  frequency)  coming  to  an  apparent 
halt.  As  the  actual  wheel  speed  continues  to  increase,  the  spokes  seem  to 
turn  backward,  and  the  phenomenon  is  repeated  in  mirror  image.  In  fact,  it  is 
repeated  indefinitely,  like  the  images  in  a  barber  shop  mirror.  The  frequen¬ 
cies  are  said  to  be  "folded,"  the  folds  occurring  at  odd  multiples  of  the 
Nyquist  frequency. 

The  Nyquist  frequency  is  equal  to  half  the  sampling  frequency.  Thus,  if 
the  movie  projector  operates  at  24  frames  per  second,  the  Nyquist  frequency  ie 
12  sookes  per  second.  It  is  convenient  for  theoretical  purposes  to  express 
frequencies  in  radians  per  second,  rather  than  in  hertz.  For  a  finite  sam¬ 
pling  interval  of  At  seconds,  then,  the  radian  sampling  frequency  is  2n/At 
and  the  Nyquist  frequency  n/A t. 


*J.  W.  Tukey  employed  the  term  "aliasing." 
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If  the  spectrum  of  a  signal  to  be  sampled  contains  frequencies  (of  suffi- 
cient  amplitude  to  be  detected)  greater  than  it /at,  frequency  folding  will 
surely  occur.  Having  occurred,  it  can  neither  be  detected  in  nor  removed  from 
the  sample.  Necessarily,  steps  to  be  taken  are  limited  to  preventive  ones. 
Three  cases  are  discussed: 

Case  1.  It  may  be  possible  to  decrease  At,  thereby  increasing  the 
Nyquist  frequency  until  it  spans  the  troublesome  frequencies. 

Case  2.  Unwanted  frequencies  can  be  removed  by  processing  the  signal  with 
a  suitable  band  pass  filter  before  sampling.  This  will  result  in  a  loss  of 
"power,”  which  perhaps  can  be  partially  compensated  for  in  the  subsequent 
digital  filter. 

Case  3*  Analysis  of  the  output  of  the  plant  (signal)  may  reveal  that  it 
is  aperiodic.  The  principal  frequency  thus  is  aero. 

In  all  three  cases,  the  function  (signal)  is  said  to  be  band  limited, 
since  its  frequency  spectrum  contains  no  frequencies  outside  the  band  defined 
by  the  Nyquist  frequency;  i.e.,  -jt/a t  c  ui  <  it/At. 

This  paper  will  treat  only  band  limited  functions.  However,  this  does 
not  mean  that  the  specter  of  frequency  folding  can  be  ignored,  since  either  a 
poor  filter  design  or  a  faulty  feed-ba^k  mechanism  can  induce  periodicity. 

III.  THE  BILINEAR  TRANSFORMATION.  In  its  most  general  form,  the  bilinear 
transformation  is  given  by 

Awz  +  Bw  +  Cz  ♦  D  -  0  (4) 

where  A,  B,  C  and  B  are  constants  while  w  and  z  are  variables,  any  of 
which  might  be  complex.  Avoiding  the  trivial  caBe  A  =•  0,  division  by  A 
obviously  does  not  disturb  the  equality.  Simplifying  thus,  let 


Then 


wz  +  yw  =  az  ♦  3 
w  ♦  ywz“l  =  a  ♦  (5z_1 


(5) 


w 


a  +  Sz~| 
1  +  yz  ~  ^ 


(6) 


It  is  equally  easy  to  solve  for  either  z  or  z_1  in  terms  of  w.  This 
demonstrates  that  the  inverse  of  a  bilinear  transformation  also  is  a  bilinear 
transformation. 


65 


The  operators  ~  ,  Z  -  1  ♦  6,  and  Z"1  -  1  -  7  submit  to  algebraic  manipula¬ 
tion,  so  that 


v  - 


g  »  SZ"1)* 
i  YZ'Ox 


(1) 


becomes  a  useful  digital  filter.  The  variable  %  is  ANY  parameter  that  can  be 
sampled  in  the  computer. 

As  will  be  seen,  the  filter  provides  an  approximate  solution  to  the  dif¬ 
ferential  equation 


dx 

dt 


f  (x,  t) 


since 


Z  -  1  +  A  -  exp  (^~) 

from  which 

-7  ■  (n  2  *  -  In  Z'1  ■  -  to  (l  -  V) 
dt  e  e  e 

Expanding  in  ascending  powers  of  7,  then  substituting  1  -  Z-1  -  7,  the 
infinite  series  in  Z-1  can  be  approximated  by  a  rational  function  in  Z  , 

The  set  of  Pade  approximants  is  a  convenient  source.  If  the  degree  of  numera¬ 
tor  and  denominator  are  chosen  to  be  the  same,  the  rational  function  can  be 
decomposed  into  factors,  each  of  which  hcs  the  form  of  filter  w. 

If  the  variable  w  is  used  to  define  an  output/input  ratio,  then  the  filter 
is  of  exactly  the  form  encountered  during  the  test. 

IV.  FILTER  CONSTRAINTS.  To  be  realizable,  the  filter  (rational  function) 
must  be  bounded.  That  is,  there  must  be  no  poles  at  infinity.  Obviously,  the 
degree  of  the  numerator  must  not  exceed  that  of  the  denominator. 

For  a  stable  filter,  under  a  conformal  mapping  into  the  r-plane  (described 
later),  all  poles  must  be  found  in  the  left  half-plane.  A  sufficient  condi¬ 
tion  seems  to  be  that  the  real  parts  of  all  denominator  coefficients  be  of 
like  sign. 
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V.  DBFIHITIOH  0?  TERMS, 


At 

♦  or 
F  or  Fj 

V*i 

G 


present  time 
previous  time 

tj  -  tQ  the  sampling  interval. 

Sometimes  referred  to  aa  one  Real  Time  Interrupt 
(RTI).  In  the  case  at  hand,  At  -  0,008192  aeo. 

the  input  at  t^  (f  •  Figure  1.) 
the  output  of  the  filter  at  tj 
momentary  gain  of  the  filter 
steady  state  gain  of  the  filter 


i+* 

For  this  discussion,  G  -  1,  with  no  loss  of  generality. 

It  is  profitable  to  investigate  the  response  of  the  filter  to  unit  step 
function  at  t^ ;  i.e., 


$n  -  0  (n  <  0) 

<t>n  -  1  (n  >  0) 

where  n  admits  only  of  integral  values. 

"huo 

tim  “  1 

i-*® 

We  shall  now  define  J,  the  toval  impulse  of  the  filter,  as 


l  “  Z  At  (F±  -  G^) 
which,  upon  substitution,  reduce  to 

1  a4  E  "  O  (7) 
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The  partial  impulse  of  the  filter  ia  defined  aa 

1  -  At  t  (?.  -  1)  (8) 

n  1  1 

When  n  ia  amall,  the  ratio  ia  useful. 

When  I_  *  4l,  that  time  can  be  oalled  the  half-life  of  the  filter, 
n  c 

It  ia  a  measure  of  the  filter's  responsiveness  (or  sluggishness). 

When  l  *  0,  there  is  no  net  feedback.  The  filter  is  a  smoothing  filter 
only  (possibly  a  very  good  one).  It  is  useless  for  control. 

When  I  >  0,  there  is  an  excess  of  output  over  input,  which  ia  available 
to  the  system  for  control.  The  amount  of  this  excess  defines  "total  impulse" 
in  a  useful  filter. 

When  I  <  0,  there  is  excessive  "power"  loss.  The  filter  allows  the 
system  to  drift  toward  instability. 

VI.  OSCILLATIHQ  FILTERS.  Repeating  for  convenience 


*1  •  *x  -  YX0 

(2) 

P,  r'  uXl  ♦  % 

(3) 

and  continuing  to  investigate  the  response  to  unit  step  function,  it  is  seen 
that 

X  -  0 
*0 

X  “  1 
*1 

X,  •  1  -  V 


X,  -  1  -  V  ♦  Y2 


X*  *  1  -  Y  *  Y2  -  Y3 

In  the  limit,  there  will  be  generated  an  infinite  series  which  converges  to 

1 


l  (  -  y)1  » 

i»0  1  4  v 


(9) 
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provided  |  Y  |  <  1 .  When  0  <  Y  <  1 ,  the  series  will  have  terms  of  alternating 
sign.  The  successive  values  of  will  oscillate  about  some  value,  as 
will  the  output  of  the  filter.  As  a  general  rule,  an  oscillating  filter  is 
not  desirable  for  control.  So  much  so  that  an  osoillating  filter  oan  be 
viewed  as  evidence  of  poor  design. 

When  -  1  <  Y  <  0,  the  series  for  (1  ♦  y)"1  will  have  terms  of  like 
sign,  and  the  filter  will  be  relatively  smooth. 

Notice  the  behavior  of  x  for  various  real  values  of  Y. 

Y  <  -  1.  The  successive  values  of  X  diverge.  The  filter  is  unstable. 

-  1  <  Y  <  0.  x  converges  to  the  value  ,  \  y.  The  filter  is  stable  and 
relatively  smooth. 

Y  -  0.  x  *  1  (constant).  The  output  of  the  filter  therefore  is  constant 
(«  *  8).  An  exception  occurs  at  ti,  where  Pi  •  «. 

0  <  Y  <  1.  X  converges,  but  oscillates  about  the  value  ^ The 
filter  oscillates  with  period  2dt.  This  is  equivalent  to  exactly  the  Nyquist 
frequency.  (See  Figure  2.) 

Y  ■  1 .  x  alternates  between  the  two  values  0  and  1 .  The  filter 
output  oscillates  between  a  and  3. 

Y  >  1.  x  and  the  filter  output  are  both  oscillatory  and  divergent. 

For  a  filter  to  be  both  stable  and  non-oscillatory,  the  coefficent  Y 
must  fall  within  the  range 


-1  <  Y  <  0 


VII.  THE  FUNDAMENTAL  IMPULSE  FORMULA.  Repeating  for  convenience 


Let  <  “  3/ a.  Then 


Clearing  of  fractions, 


w  3 


F  a  aZ  +  3 
$  3  Z  ♦  Y 


(1  +  YZ_1)F  -  a(l  + 


This  difference  equation  can  be  written 


F 

n 


♦  YF 


n-1 


a[  $  + 

•  n 


n-1 


I 


(1.1) 


(1.2) 


70 


’Trop-or '  T'jjjp*' ,  •  ‘r  yr’TIHfWffl'Ty 


$  *  1 
n 


(n  >  0) 


iim  _  , 

i**  i 


Hence 


1  ♦  y  ■  o  [ 1  +  k] 


From  the  difference  equation  ie  derived 


n  n-1  n  n-1 

If  M  I  F  -  o  [  £  $ .  +  <  l  $  ] 
l  i  o  i  1  1  0  i 


Since  Fo  *  to  "  °*  this  can  be  written  as 


n-1  n-1  n-1  n-1 

IP  ♦  F  m  £  F  *  a[  I  ♦  +  ♦  ♦  t  2  i  ! 

1  i  n  i  i  1  i  n  i  i 


(1  *  if)  I  F  -  a  (1  ♦  0  J  ♦<  -  a  6  -  f 

1  i  i  l  n  n 


hence 


1  +  y  ■  <*  (1  ♦  <), 


(1  ♦  y)  2  (F  ■  (  )  -a  *  • 
i  i  i  n 


Remembering  that  t  *  1  and  letting  n-*», 


?  <pi  -  ’>  ‘  TTl  ’ 


Making  the  substitution 


1  ♦  Y 
1  tc 


(10) 


I  (F  .  D  .  _L_  _  «  —1—  _  — 1— 

1  '  i  '  1  +  <  1+Y  1  ♦  Y  1+ic 


(12) 


For  two  oasoaded  stages 


p  (t  ♦  <1Z-1)0  ♦  *2z~l) 

♦  ‘  °ia2  (1  '♦  YiZ‘f)(l  +  Y2Z_l) 


(13) 


An  exactly  similar  development  yields 


1 


1 


OD  1  ^ 

1  (Fi  1)  i  +  <x  *  1  ♦  <2  "  1  +  Yj  "  1  ♦  Y2 


JLi__  _ _ JLz _ 


1  Y  i  1  ♦  Y2  1  ♦  1  ♦  <2 

The  subscripts  on  the  right  refer  to  filter  stages. 


(H) 


It  is  apparent  that  the  process  can  be  extended  to  any  number  of  stages  — 
say  j.  Thus  can  be  stated  in  general  form  (for  j  stages)  the  FUNDAMENTAL 
IMPULSE  FORMULA 

I  ■  At  S  (F.  -  D 
1  1 


At 


♦  ....  + 


At 


/  1  1 
\l  ♦  <i  +  1  ♦ 

rH' . r=~y 

(  V  .  .h.  .  .  _Ii 

V  ♦  Y1  1  ♦  Y2  *•”  1  ♦ 

*r) 


1  ♦  *  1  ♦  Y, 


Y.  1  ♦  <1 


1  ♦  <2 . 1 


Or,  since  1  ♦  Y  *  a  (1  ■*■<), 


(15) 

(10) 
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1 


•  «  •  • 


(16) 


r 

i 

I 

r 

r 


r 

t 


b 


I 


At 


a„  - 


*  TTT0 


•f 


For  a  single  stage, 


1  *  At 


(ID 


The  constraints  upon  y  imposed  by  the  requirement  for  filter  stability 
ensure  that  the  denominator  is  positive.  Hence  a  >  1  is  the  necessary 
condition  for  a  useful  control  filter  (1  >  0).  It  follows  immediately  that 
1  +  <  also  is  positive  and  that  y  >  <• 

For  a  given  value  of  tc,  a/(l  +  y)  remains  constant,  but  -1/(1  ♦  y) 
increases  with  increasing  y.  The  function  is  maximised  (for  the  allowable 
range)  at  y  *  1 •  We  choose  the  notation 


max 


For  j  stages, 

!max  '  H  (ai  *  a2  +  ••  *  “j  ' 

But  if  y  »  1 ,  a  -  1  ■  |  ~  ,  and 


,  at  (IrJi  1  -  *2  1  - 

^max  ”  2  \1  *  *  1  +k2  +  '*“  +  1  +  k  f 


(17) 


It  is  proper  to  think  of  as  a  boundary  condition.  But  lm„T 

is  clearly  unattainable,  since  the  boundary  is  not  included  in  the  (open) 
region.  It  will  be  found  that  1^  always  is  reduced  by  an  amount  which 
shall  be  called  the  attenuation  and  which  is  defined  by 


A 


max 


-  I 


(18) 


The  proper  dimension  is  some  unit  of  time;  e.g.,  seconds  or  RTI's. 

lmax  can  be  called  the  "desired  total  impulse."  It  appears  in  tha 
numerator  of  the  r-plane  transfer  function,  as  will  be  seen  later.  Since  it 
is  a  function  of  only  the  Kj'e,  the  latter  can  be  called  the  "Impulse 
coefficients." 


7  4 


VIII.  THE  ATTENUATION.  A.  Attenuation  is  defined  by 


I 


max 


l 


(18) 


Tvo  forms  of  tbe  Fundamental  Impulse  Formula  can  be  combined  by  simple 
addition  to  yield 


2 


Z1  -  *1  +  ]_2 

V1  4  <1  *  1  ♦ 


1  -  IC- 


1 


1  -  Y 


1 


1 


1  ♦ 


Y2 


*2 

-  ■■  --1) 
•  •  •  •  4  .  I 

1  *  *j/ 


1  +  Kj 


1  + 


Y 1 


Thus 


A  “ 


If  we  define 


(LUl  1  ~  y2  1 :  h\ 

\1  *  Yi  +  1  ♦  Y2  *  *  T  ♦  Yjj 

Aj  2  \l  ♦  Y j  / 

then  each  Aj  >  0,  due  to  the  limits  imposed  upon  y*  Further,  since 
[  >  0  (for  a  control  filter),  A  <  Imax‘ 


(14) 


(19) 


Therefore 

Imax  >  A  "  Ax  •"  A2  +  ’ 

The  attenuation  can  be  computed  separately  for  each  stage  and  the  parts  added. 


>° 


Notice  that  each  Aj  a  function  of  Yj  alone.  It  is  there¬ 
fore  proper  to  call  the  Yje  "coefficients  of  attenuation." 

For  a  single  stage,  let  us  observe  the  effect  upon  Aj  °f  various 
values  of  y • 


-  1  <  Y  <  0  implies  Aj  >  ~ 
Y  -  0  implies  A^  " 

0  <  y  <  1  implies  A^  <  ^ 
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The  filter  characteristics,  previously  stated  in  terms  of  Y»  can  now  be 
stated  in  terms  of  A*  For  j  stages,  then 

0  <  A  <  ^jAt  filter  oscillates 

A  ■  ^jAt  filter  finite  of  duration  jAt 

jjAt  <  A  <  filter  smooth 

A  “  I-.-*  zero  total  impulse 

USX 

As  A  approaches  I  max'  filter  becomes  more  sluggish. 

Particularly  note  that  a  requirement  that  the  filter  be  non-oscillatory 
places  a  finite  limit  upon  the  number  of  filter  stages.  This  limit  is,  of 
course 


J  < 


21 


sax 


At 


IX.  COMPUTING  THE  COEFFICIEHTS.  Por  a  single  stage 

A  -  A£  or  y  •  &t„-  2A 

H  2  M  ♦  y  Y  At  +  2A 


I  •  At  ,1  - 
‘max  —  (rr^) 


At  -  2] 


or 


At  ♦  2J 


MA 


max 


(19.0 

(17.1) 


Now  A  is  a  function  of  frequency  response  and  may  be  amenable  to  some 
adjustment.  Not  so  Ima**  It  is  the  "power"  demanded  of  the  filter,  and 
we  expect  to  deliver  only  l  of  it.  Can  we  recover  completely? 

The  answer  is  yes.  Instead  of 


A  -  I max  *  1  (I0) 

we  write 

A  -  (lfflax  +  A)  -  (I  +  A) 

or 

A  “  ( Imax  +  A)  Imax  (2®) 
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We  now  find  that 


or 


^max 


At  1  - 
2  1  +  < r 


it  -  21^  -  2A 

"  2I„,  *  2A 


(21) 


In  other  words,  we  enter  +  ft  into  the  formula  for  the  impulse 

ooeffioient  and  let  the  filter  attenuate  it  hack  to  I  max*  **  *8  possible 
to  do  thia  because,  in  the  time  domain,  so  many  of  the  terms  are  additive. 
Particularly  note  that  ft  must  be  determined  first. 


Now  1 mx  is  a  design  requirement  imposed  upon  the  filter.  Arbitrar¬ 
ily  augmenting  it  by  some  amount  (say  ft)  altera  nothing  in  principle.  The 
formulae  can  be  stated 


w.  .  l:  .  scl-lx 

4>  Z  ♦  Y 

k  '  *  $'/a' 


1  *  y  -  o '  { 1  ♦  ic ' ) 


(22) 

(23) 

(24) 

(25) 


where  the  primes  denote  the  new  values  resulting  from  the  augmentation  of 
Imax*  Note  that  aQd  y  are  independent  of  lnax  (as  is  ft)  and 
hence  are  not  primed. 


X.  COMPUTING  THE  COEFFICIENTS  FOR  A  TWO-STAGE  FILTER.  If  and  ft 

are  known,  a  single-stage  filter  is  uniquely  determined,  since  it  contains,  in 
essence,  only  two  coefficients  (y  and  «'). 


Suppose  the  taek  is  to  design  a  two-stage  filter.  Other  means  must  be 
found  for  determining  two  of  the  four  coefficients. 

The  first  step  is  easy.  For  a  non-oscillatory  filter, 

U,  > A  > 5^ 


If  we  set  Y  “  "  Y2'  then  ftj  -  ft2 

stage  will  be  oscillatory. 


2 At  and  it  is  ensured  that  neither 


IM,  *  a  •  (rTT^t  * 


(26) 
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requires  only  that  the  sum  within  parentheses  be  oonstant.  Solving  for  either 
*c'  in  terms  of  the  other 


-  p  *  <VUm. 

1  "  <',At  ♦  (1  ♦  ic'2)(L 


*  A) 

.  +  A) 


Allowing  k  '2  to  incre  ase  without  bound 


•(U ;  a> 

At  +  f  ♦  A 
‘max  H 


If  <  *  k  '2  *  then 


4t  ~  (l»».  •  A) 
4t  *  I»«!  *  A 


-  1  ♦  2P 


We  thus  establish  limits  for  the  <'*&• 

P  <  K\  *>  1  ♦  2P  *.  k'2 

It  is  observed  that  as  the  tc'^s  approach  1  ♦  2 P,  more  and  more  "power"  is 
delivered  at  the  first  RTI .  In  some  cases,  the  half-life  can  be  less  than 
At/2,  causing  the  filter  to  over-corre9t  and  generate  unwanted  noise.  This 
effect  is  most  marked  when  the  two  s  are  equal.  (At  1  ♦  2P,  of  course.) 

Little  case  can  be  made  for  a  half-life  less  than  At.  Using  this  as  a 
restriction,  and  noting  that  at  the  first  RTI 

T~  “  a',a',  (30) 


it  develops  that 


(a \a  '2  ~  1)  At  <  rj 


2 ‘max 


1  ♦  Y  *  a'  0  +  *')  - 


At  ♦  A 


allowing  us  to  develop  a  second  equation  in  the  s.  The  solution,  provided 
it  exceeds  1  ♦  2P(  will  furnish  a  practical  lower  limit  for  k',.  To  avoid 
negative  total  impulse  (for  the  stage),  use  an  upper  limit  of 

Often  it  is  effective  to  set  k'2  equal  to  unity.  A  useful  side  effect 
is  that  one  term  drops  out  of  the  1^*  equation,  since 

i  ♦  a  -  a* 

‘max  H  2  \1  +  <  1  ♦  1  / 
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and  thus  the  formula  for  k' 
filter  is  now  implemented  by 


ia  the  same  as  the  single-stage  formula.  The 


nm  a' a'  [  o  +  z-;.)  ~ 

0  i  2  [  (TT^rnT(i^z^T)  _ 


(32) 


Extending  the  method  (1  =  <‘2  ”  *'3  “  etc.)  to  filters  of  still  more  stages 
may  not  be  warranted.  The  additional  stages  will  be  smoothing  stages  only, 
and  the  resulting  filter  can  be  very  sluggish.  In  fact,  the  ratio  In/q 
may  actually  take  on  negative  values  for  the  first  few  RTI's,  a  most 
undesirable  characteristic  for  a  control  filter.  (See  Pigure  3.) 


■■  ■••••  •  ■  > 


In  order  to  illustrate  the  effect  of  varying  *  '2 ,  three  compensated 
two-stage  filters  were  synthesized  to  the  requirements 

1  *>  25  At 

max 


A  -  |*t 


<'2  was  arbitrarily  set  in  turn  to  the  values  0,  and  - 
Results  are  depicted  graphically  in  Figures  4,  5  and  6. 
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XI.  TRANSFORMATIONS  AND  CONFORMAL  MAPPINGS.*  A  useful  tool  developed  to 
simplify  the  solution  of  certain  differential  equations  is  the  Laplace 
Transform,  defined  by 

L  (F(x)  ]  *  /e”9X  d*  " 

0J 

provided  the  integral  exists.  Therefore,  the  Laplace  Transform  is  an  integral 
operator. 


♦In  this  paragraph,  j  *  /-\ 


The  complex  variable  a  is  of  the  form  s  •  o  ♦  jw,  a  and  u»  being  real. 
The  function  f(s)  ia  many-valued,  being  periodic  in  4jnw0,  where  n  is  any 
integer  and  wQ  ia  the  Nyquist  frequency  (often  expreased  in  radians  per 
second). 

For  band  limited  functions,  it  becomes  expedient  to  define  another 
complex  variable 


r  -  p  +  ju 


by 


rAt  .  .  sAt 

—  ■  tanh  ~Y~ 


(55) 


The  frequency  response  in  both  r-  and  s-planes  is  given  by  setting  p  M  o  *  0, 
from  which 


loAt  ,1u>At 

2“  *  tanh  ,d— g- 


-  j  tan  (-“**) 


Thus  *  tan  It  follows  that  for  ui0 

maps  into  the  entire  r-plane.  As  a  result,  the  new  variable  r  is  single- 
valued. 


the  s-plane  zero-atrip 
At 


u>  is  called  the  NATURAL  frequency. 

u  is  called  the  WARPED  frequency.  It  is  found  to  be  related  to  the 
attenuation  by 


2 

A  *  u 


(54) 


It  is  very  easy  to  demonstrate  that  the  transformation  £&£  ■  tanh  2Ai  is 

2  2  „ 

1  - 
- 


bilinear  in  Z- 1  and  r ,  since  Z  =  e8A*  and  tanh  x  -  t — 1—2 — 


e*  ♦  e“* 


1  ♦  e 


whence 

rAt  .  . 

2  ’  tanh 

The  inverse  transformation  is,  of  course 


1  -  zm\ 

1  +  z"i 


(55) 


7-1 


I  - 


1  *  “i 


(36) 


In  the  present  case,  the  form  of  the  filter  is  known,  and  is  expressed  in 
powers  of  Z-1  .  The  r-plane  transfer  function  is  recoverable  immediately 
by  direct  substitution. 
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it  is  seen  that  the  r-plane  transfer  function  can  be  written 

F' 

—  m 

4> 

It  should  be  clear  that  solution  in  either  the  r-plane  or  the  Z~  ^plane 
(which  two  are  connected  by  the  stated  bilinear  transformation)  is  easily 
implemented  in  a  digital  computer,  since  only  straightforward  arithmetic  is 

involved.  Not  so  in  the  s-piar.e,  where  logarithms  (e.g.,  tanh 

exponentials,  and  the  like  will  be  required. 


For  a  filter  of  J  stages,  if  Y  «  y  •  Y  •  ... 
...  -  tc'j  *  1,  the  r-plane  transfer  function  is 


Ll 

♦ 


0  ♦  rA)J 


Yj  and  k'2  -  <'3  - 

(39) 
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EXTENDED  ABSTRACT 

An  improved  procedure  for  treatment  of  so-called  censored  data  has 
been  developed  and  life-time  estimates  made  for  proof  tested  ceramic  rotor 
hubs,  in  addition  to  development  of  quality  assurance  control  of  powder 
metallurgically  produced  turbine  engine  discs.  These  represent  situations 
for  structures  to  perform  under  extreme  environmental  conditions  and 
analytical  procedures  to  aid  in  achieving  required  component  capability. 

Two  and  three  parameter  Lognormal  and  Weibull  functions  represent  the 
candidate  statistic  models.  These  functions  are  examined  for  best  repre¬ 
sentation  of  data  in  order  to  provide  flexibility  in  the  fitting  process. 
The  functional  parameters  are  obtained  from  the  maximum  likelihood  (M.L.) 
method.  This  method  provides  a  superior  representation  of  the  cyclic 
fatigue  data  as  compared  to  the  more  conventional  procedures.  The  M.L. 
method  can  also  provide  the  desired  confidence  limits  for  the  parameter 
and  reliability  determinations  associated  with  the  given  data  set.  The 
inadequacies  associated  with  the  method  of  moments,  graphical  procedures, 
etc.,  in  obtaining  the  functional  parameters  is  recognized  from  the  arbi¬ 
trariness  of  the  functional  representation  of  the  data.  The  acceptability 
of  these  methods  is  acutely  data  dependent. 

The  need  for  considering  all  data  including  censored  data  is 
established.  Both  lower  and  upper  bound  censored  data  are  considered 
as  they  relate  to  proof  testing  and  run-outs  respectively.  An  improved 
probability  of  failure  computation  can  be  obtained  when  the  total  data 
set  is  represented.  Partial  probability  ranking  procedures  tend  to 
introduce  substantial  errors  in  the  extrapolation  process  necessary  in 
obtaining  minimum  life-time  estimates.  By  including  censored  data,  one 
can  provide  a  more  complete  understanding  of  the  materials  capabilities. 

The  results  of  combining  the  M.L.  method  with  the  inclusion  of 
censored  data  are  compared  with  conventional  procedures  in  obtaining  both 
structural  reliability  and  material  probability  of  failure  computations. 
The  comparison  indicates  a  substantial  nonconservative  estimate  of  failure 
probabilities  can  occur  if  threshold  stress  values  are  obtained  from  proof 
testing  without  consideration  of  the  censored  data.  Application  of  the 
M.L.  procedure  provided  an  improvement  in  the  functional  representation  of 
data . 
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Introduction 


ANALYSIS  OF  Cf.’.SuM.D  DATA 


Oftentimes  in  procurement  of  structural  ceramic  components,  screening 
tests  are  employed  to  attempt  to  verify  component  (part)  quality.  Typical 
of  such  tests  are  room  temperature  (cold)  spin  tests  of  rotor  hubB.  Usual 
practice  calls  for  delivery  of  successful  spin  tested  parts  and  these  are 
then  treated  as  if  guaranteed  strengths  were  existent.  We  desired  to  more 
fully  exploit  the  information  gained  during  such  screening  tests.  As  an 
example,  during  the  conduct  of  screening  experiments,  failures  were  observed. 
Ordinarily  these  failures,  or  the  failure  data  is  not  reported.  It  is 
obvious  that  the  failure  rate  data  provides  useful  information  in  planning 
for  component  reliability  levels  and  is  necessary  to  establish  a  rational 
quality  assurance  plan.  Thus  this  study  explores  the  use  of  censored 
statistics  to  provide  reliability  estimates  incorporating  minimum  screening 
strength  levels,  and  also  the  failure  rates  (standard  deviations  ind.) 
associated  with  spin  tests.  The  influence,  for  instance,  of  5%  versus  10% 
failure  rate  during  screening  tests  are  documented.  Monte  Carlo  techniques 
are  employed  to  establish  desired  screening  test  procedures. 

Suppose,  for  instance,  fast  fracture  probability  of  failure  estimates 
were  made  using  conventional  Weibull  statistics.  In  this  instance,  the 
screening  level  is  treated  as  a  lower  bound.  However,  censored  data 
techniques  allow  taking  into  account  the  likely  component  failure  rates, 
based  od  the  observed  screening  test  data.  The  purpose  of  the  following 
calculation  is  to  compare  the  degree  of  conservatism  of  the  two  reliability 
estimates.  (It  was  observed  that  the  censored  data  technique  provides  the 
more  conservative  results.) 

These  comparisons  provide  confidence  in  using  the  screening  test  data 
itself  for  the  estimates  of  production  reliability.  The  implication  of 
these  results  is  that  continued  local  mechanical  strength  determination 
testing  can  be  minimized  and  lot  component  sampling,  coupled  with  spin 
tests  can  be  adopted  to  insure  hardware  reliability.  Treated  random  sample 
selection  from  lots  can  predict  corresponding  failure  of  total  lot.  It  is, 
therefore,  important  to  consider  failure  below  minimum  load  level. 

In  representing  fatigue  data  were  run-outs  (non-feiled  specimens 
tested  at  predetermined  number  of  cycles)  exists,  it  i:  usually  necessary 
to  apply  graphical  methods  in  determining  prescribed  probabilities  and 
their  corresponding  cycles  to  failure.  The  graphical  approach  requires 
representation  of  only  the  failed  results.  The  remaining  data  is  included 
only  in  representing  the  ranking  of  the  data.  This  method  is  often  sus¬ 
ceptible  to  error  because  of  the  arbitrariness  that  exists  in  interpreting 
an  acceptable  regression  line  for  ranked  data.  Optimum  coefficient  methods, 
for  example,  will  introduce  sizable  variation  in  slopes  such  that  a  unique 
threshold  value  for  function  becomes  very  difficult  to  determine.  Since 
the  extrapolation  of  the  regression  line  provides  the  necessary  probability 
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number  it  is  therefore  critical  that  the  slope  of  this  line  be  properly 
determined.  If  all  data  is  considered  including  run-outs  then  a  censored 
data  analysis  procedure  must  be  applied.  The  present  analysis  outlined 
in  this  text  applies  this  analysis  including  the  appropriate  M.L.  pro¬ 
cedures.  The  Weibull  and  Lognormal  distribution  were  candidate  functions 
since  they  are  usually  the  most  acceptable  representation  of  fatigue  data. 

A  comparison  between  graphical  results  and  that  of  the  M.L.  censored  data 
method  indicate  substantial  differences.  The  graphical  results  showed 
highly  non-conservative  estimates.  In  this  instance  the  component  material 
would  have  been  rejected  instead  of  accepted  as  indicated  by  the  censored 
data  method. 

Following  is  a  general  description  of  the  analytical  technique  developed 
in  treating  the  problems. 

Weibull  Function 


The  Weibull  function  has  been  commonly  used  in  failure  prediction  of 
ceramic  and  high  strength  fatigued  materials,  It  was  determined  from  the 
analysis  of  the  rotor  disc  and  helicopter  component  data  that  the  best 
representative  function  was  also  Weibull,  therefore,  the  M.L.  analysis  of 
censored  data  for  this  function  will  be  developed  in  this  paper.  The  Weibull 
probability  density  function  of  the  random  variable  X  is 

f(X|ou.  o0,  m)  =  [m(X-ou)IT'‘1/ooin]  exp  { -  [  (X-ou)/oo]m) 


where  a  ,  m>0  and  X>0  >0 
o’  -  u- 

o  ,  o  and  m  are  the  location,  scale  and  shape  parameters  respectively. 
T&e  log  of. the  Weibull  likelihood  function  for  dual  censoring  can  be 
written  as1  J 


LrtL  -  LnN!  -  Lnr!  +  (N  - r )  ( Ln m - pjLn o  )  -  Ln(N-N  ) 

o  o  o 


No  N0 

+  (m-1)  l  Ln(X.-o  )  -  [ 

i=r+l  i=r+l 

+  rLn{l-exp[-(Xr+1-ou)m/oo,T']}  -  (N-N0)[(XN  "Ou)/oo^ 


(2) 


where 

N  =  total  number  of  data  points  including  the  censored  values, 
N  =  number  of  values  prior  to  run-outs  (fatigue  data) 
and  ° 

r  =  number  of  data  values  less  than  the  proof  tested  value 
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The  M.L.  equations  are  determined  from  the  partial  derivative  of  LnL 
with  respect  to  the  three  parameters  set  equal  zero.  That  is, 


where 


3  LnL 

55“  =  0 

u 


~  »  -m'NrJ/o^m  [  (X  -o  )®/o  B+1 
o  i=r+l 

3l.nL  W1  JJ  N 

•  C-PHi  '  L"  »0)  *  I  U(Xro  )  -  J 

i  =  r*l  i  =  r* 1  1  u  u 

Lnt(Xi-ou)/o(j]  .  '■(Xr.rou)"Ln[(Xr>1.ou)/oo]exp(-[(Xr>1-„u)/0o]")/ 
t»0"<l-.xpt-(Xr  -,  )>  »])] 


3  LnL  ? 

-sr*m  (: '•*>J.1CV<’„> 
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'■ '5pC-  (xr. f -««pC-  (Xr. , )] 


*Note:  When  N  =  H  and  r  f  0  then  lower  censoring  is  applied  as  required 
for  ceramic  disc  analysis.  If  r  =  0  and  N  f  N  then  upper  censoring  is 
used  in  evaluating  fatigue  results  for  the  helicopter  component.  The  result¬ 
ant  equations  above  are  for  the  case  where  N  =  N  . 
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\n  iterative  procedure  has  been  developed  for  determining  the  M.L.  parameters 
6  ,  6  and  m.  Initial  estimates  are  obtained  from  the  moment  method  without 
censoring.  From  these  estimates  each  parameter  is  determined  one  at  a  time 
in  cyclic  order  in  equation  3  until  reasonable  convergence  is  obtained.  At 
each  step,  the  rule  of  false  position  is  used  to  determine  values  which 
satisfied  the  likelihood  equation  with  prior  estimates  of  the  other  parameters 
remaining  constant.  Note,  if  a  large  percent  (greater  than  50)  of  censored 
values  exist,  then  it  is  necessary  to  gradually  increase  the  amount  of 
censoring  in  order  to  obtain  desired  convergence.  If  a  two  parameter  Weibull 
function  is  desired  then  omit  ou  in  the  computation  process. 

Lognormal  Function 

Although  the  Weibull  extreme  value  function  is  commonly  applied  in 
representing  ceramic  strength  data,  the  lognormal  function  can  provide 
an  option  if  the  Weibull  function  is  not  acceptable.  The  lognormal 
function  has  been  included  in  the  evaluation  procedure.  The  likelihood 
function  is  defined  as:  J 


f(X 


N! 


r+1  ’ 

N„ 


(N-NJ.'r!  11 


.  1  1  .  o<^2n  (X  .  -t) 

i  =  r+i  l 


Xjj  U,a, t) 

exp  {- 


N-N 


1  -  F[Zn  ])  °  {F(Zr+1]}r 


N0  [ Ln(X . -t) -w] 

I  “ 

i=r»l 


2o 
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where  t  is  the  location  or  threshold  parameter  and  m,  a  are  mean  and 
standard  deviations  respectively.  N.  r  and  N  are  defined  in  the  Weibull 
analysis. 

The  function  F  is  defined  as: 


Z. 

F  «  /  1f(t)dt 


where 


and 


Zi  =  [Ln(Xi-t)-u]/o 
f(Zj)  =  exp  (-Z.2/2) 
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The  complete  development  of  the  H.L,  function  for  the  log  normal  is 
omitted  since  it  is  similar  to  that  developed  for  the  Weibull  function. 

A  much  more  severe  convergence  problem  exists  in  determination  of  Lognormal 
parameters,  particularly  for  50%  or  more  censoring.  Therefore,  it  is 
important  to  obtain  reasonable  initial  estimates  in  addition  to  introducing 
small  increments  of  censoring  until  the  desired  amount  is  obtained. 


Duality  Assurance  of  Rotor  Discs 


The  failure  prediction  procedures  described  previously  where  applied 
to  data  obtained  from  both  spin  and  flexure  tests  of  rotor  discs.  Tests 
were  made  in  order  to  establish  quality  assurance  of  the  disc  material  prior 
to  manufacture  into  ceramic  engine  rotors.  The  spin  test  is  applied  initially 
in  order  to  guarantee  a  minimum  strength  level  for  the  disk.  This  lower  bound 
(threshold  strength)  was  obtained  from  spinning  the  disk  at  an  angular  velocity 
of  60,000  rpm.  The  equivalent  forth  point  flexure  test  results  are  350  N/mrn 
at  rim  section  (R^  Figure  3)  of  disc,  this  stress  value  is  obtained  as  shown 
in  Figure  A. 
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Subsequent  flexure  tests  were  conducted  on  the  surviving  disks.  The  test 
specimens  are  selected  from  locations  outlined  in  Figure  3- 

The  Results  of  the  Censored  Data  Analysis  Procedures 

Data  is  analyzed  for  flexure  specimens  obtained  from  ring  1  of  disk 
(see  Figure  4)  in  order  to  be  consistent  with  the  threshold  strength  level 
obtained  from  spin  test  at  this  same  location.  Since  there  was  a  limited 
amount  of  data  from  available  disks,  it  was  necessary  to  generate  additional 
data  in  order  to  demonstrate  the  effects  of  the  censoring  process  as  related 
to  failure  predictions  of  the  material.  The  censored  data  relates  to  the 
number  of  failed  discs  resulting  from  spin  tests.  The  remaining  data  was 
obtained  from  selecting  randomly,  values  generated  from  the  Weibull  func¬ 
tional  representation  of  flexure  results  in  ring  1. 

Initially  is  was  assumed  that  flexure  data  was  obtained  from  100  rings 
without  consideration  or  knowledge  of  the  number  of  failed  discs  in  spin  test. 
A  plot  of  the  ranked  data  (flexure  strength)  and  the  corresponding  Weibull 
functional  representation  is  shown  in  Figure  5-  The  RMS  error  tabulation 
determines  the  best  functional  representation  and  is  defined  as: 


N  -  . 

RMS  -  [£  (R.(X.)  -  F  (Z.))VNr 
i=l  •* 


(5) 


where 


o  „  •  5  n  _  * ~  •  ^  j  p  _  i 

R1  N  *  R2  '  N+.4’  d  R3  ~  N*T 


i  -  1.  2,  3, 


and 


N  *  sample  size 


F  is  the  cumulative  density  function  selected  from  the  four  candidate 
functions  (normal,  lognormal,  Weibull  and  the  radical  function).  See 
Reference  4  for  details  regarding  radical  function.  In  Figure  5,  the  radical 
function  was  the  best  fit  with  Weibull  the  next  best  representation.  The 
mean  and  standard  deviation  is  also  tabulated  with  their  corresponding  90% 
confidence  intervals.  In  box  the  labelled  Weibull  parameters,  the  dispersion 
scale  (char,  value)  and  threshold  value  o  (origin)  are  displayed  with  90% 
confidence  intervals,  o  intervals  are  not  as  precise  estimates  as  those  for 
the  other  parameters. 
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Figure  5 

The  99%  origin  number  is  the  99%  probability  of  survival  value  with 
adjacent  number  being  the  lower  95%  confidence  limit.  The  latter  number 
represents  t,he  so-called  design  A  -  allowable.  In  this  instance  it  is 
368.35  N/mni  .  The  radical  parameters  are  also  tabulated  but  will  not  be 
described  in  this  text  (see  Reference  4).  The  box  for  non-parametric 
solution1  provides  for  design  A  and  B  allowables  when  parametric  repre¬ 
sentation  of  the  data  is  not  acceptable.  The  optimal  tabulation  indicates 
300  and  30  data  points  are  necessary  for  the  A  and  B  allowables.  Instances 
where  99  data  points  are  available,  a  99%  survivability  point  (smallest 
stress  value)  has  a  63%  chance  of  being  correct.  The  B  allowable  (90% 
survivability)  has  at  least  a  95%  guarantee  of  being  correct.  The  design 
A  allowable  determined  from  Weibull  function  wall  be  of  primary  consideration 
in  evaluating  the  effects  of  censoring  data  as  it  relates  to  hypothetical 
failure  rates  of  the  discs. 
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The  results  from  Figure  5  essentially  describe  the  failure  prediction 
of  the  disc  when  with  additional  data  (spin  test)  omitted.  There  can  be  a 
serious  problem  existing  if  this  spin  test  result  is  omitted  since  the 
test  only  prescribes  material  strength  guarantee  for  small  regions  of 
the  disc.  Therefore,  it  is  essential  to  recognize  that  both  spin  teBt  and 
flexure  tests  are  material  strength  characterization  tests.  Figure  6 
presents  the  results  from  a  10%  failure  rate,  that  is,  assuming  spin  test 
resulted  in  10%  of  total  number  of  discs  failed.  In  this  case,  the 
A-allowable  is  297. 59  N/mm  ,  a  sizeable  reduction  from  tbe  case  were  not 
spin-test  failure  existed.  Lognormal  was  the  other  candidate  function  but 
did  not  provide  the  best  representation  of  data.  Figure  7  shows  the  results 
for  20%  failure  with  a  resultant  A-allowable  of  230.42  N/mm  .  If  a  failure 
rate  of  30%  existed  then  A-allowable  would  be  194.57  N/mm  as  noted  in 
Figure  8.  The  data  is  not  well  represented  in  this  case,  but  if  no  alternative 
is  available  then  these  results  will  provide  the  necessary  conservatism  in 
contrast  to  omission  of  the  censored  results.  The  effects  of  ignoring  spin 
test  failures  as  they  relate  to  censored  disc  data  is  obvious,  therefore,  it 
is  important  to  consider  all  test  results;  both  flexure  and  spin  tests.  In 
Figure  9,  a  plot  of  probability  of  survival  versus  RPM  is  shown.  Note  the 
reduction  in  allowable  RPM  when  spin  test  failures  have  been  considered.  For 
example,  if  30%  failure  rate  existed,  then  95%  survivability  of  additional 
discs  would  limit  tbe  maximum  speed  to  40,000  RPM. 


Figure  9 

Fatigue  Data  Evaluation  (Helicopter  Engine  Component) 

The  low  cycle  fatigue  data  with  upper  censoring  (run-outs)  shown  in 
Table  I  was  obtained  by  strain  control  node  of  testing  where  total  axial 
strain  range  is  the  controlled  parameter  being  held  constant,  The  material 
is  a  HIP  Rene  95  powder  metal  used  in  helicopter  engine  components. 

MTIOE  TBT  RESULTS 


Iterf.g)  Fhcn  Hm*. 

£baawO»T* 


i 

Rh*  <i/*l> 

Cicu*  (Uiiurt) 

Uc  Ctcui 

1 

.009 

066 

1.90 

2 

.ax 

11651 

*.066 

1 

.fit 

16281 

*.212 

4 

.on 

1 MS 

*.216 

5 

.m 

17*13 

4.2*1 

6 

.115 

174J4 

4.2*1 

7 

.is 

17761 

4.2* 

t 

17B 

4.54 

9 

.171 

XB66 

4.«9 

10 

.IS 

SV6* 

4.491 

11 

.212 

SUB 

4.5ff 

12 

.2)1 

SB 

4.589 

13 

.250 

*ue 

4.65 

14 

.266 

V177 

4.674 

15 

.31 

*7X9 

4.05 

16 

.307 

vsa 

4.660 

17 

.127 

VW3 

4.661 

S  .961  47TB  n.641 


TABLE  I 


95 


Initially  the  ranked  data  tabulated  in  Table  I  was  plotted  on  Weibull 
probability  paper  (see  Figure  10).  A  regression  line  was  obtained  from 
an  optimum  condition  coefficient  results.  The  95%  confidence  linits  for 
the  line  are  shown  in  the  figure.  The  3.54  cycles  designation  deacribes 
the  99%  probability  for  a  larger  log  cycle  to  failure.  That  is,  there 
is  a  one  percent  chance  for  the  log  cycles  to  failure  to  be  leas  than 
3-54.  The  95%  confidence  limit  was  determined  in  conjunction  with  the 
99%  greater  cycles  to  failure  in  order  to  describe  an  A-allowable  for 
fatigue  strength. 
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Figure  10 

In  Figure  11,  a  plot  of  the  ranked  data  with  the  corresponding 
functional  relation  are  shown.  Functional  parameters  were  obtained  from 
the  censored  data  analysis  described  previously.  The  failure  to  represent 
lognormal  results  was  due  to  excessively  large  percent  of  censoring.  This 
could  have  been  corrected  if  partial  censoring  was  implesiented .  The  two 
parameter  Weibull  functions  excellent  representation  discouraged  the  need 
for  this  modification.  The  relatively  small  RMS  values  for  the  Weibull 
function  are  consistent  with  results  for  the  small  residuals  noted  in  the 
graph  (see  Figure  11).  The  broken  line  representing  the  graphical  method 
results  shows  relatively  poor  representation  of  the  data.  This  was  not 
noticeable  from  the  graphical  plot  of  data.  Although  this  poor  represen¬ 
tation  occurs  in  this  instance,  other  sets  of  censored  data  were  well 
represented  by  the  graphical  procedure.  The  problem  exists,  in  that  the 
graphical  method  is  not  consistent  in  providing  a  good  representation. 

The  M.L.  method  applied  to  the  censored  case  invariably  results  in  a 
desirable  data  representation.  A  tabulation  of  the  A  and  B  allowables  are 


shown  in  the  figure.  Note  the  relatively  large  differences  in  the  results 
from  the  two  methods.  The  consequence  of  this  overly  conservative  estimate 
from  the  graphical  procedure  can  result  in  unnecessary  rejection  of  a  very 
expensive  engine  component.  Figure  12  shows  the  results  from  another  set 
of  data  where  a  considerable  large  amount  of  censoring  exists.  Note,  the 
excellent  representations  of  this  data  by  the  H.L.  method. 


FIGURE  II 
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CONCLUSIONS 


Tbe  analytical  treatment  of  truncated  data  obtained  from  the  ceramic 
rotor  has  been  discussed  in  some  detail  since  the  results  have  important 
implications  regarding  use  of  proof  testing  and  qualification  data. 
Furthermore,  calculations  of  the  type  presented  herein  are  of  importance 
for  establishing  meaningful  production  lot  sampling  procedures  which  rely 
on  limited  quality  assurance  and  spin  test  data.  It  was  evident  that 
neglect  of  statistical  information  contained  in  the  spin  test  failure 
rate  data,  leads  to  non-conservative  mathematical  representations  for 
material  behavior. 

The  previous  discussion  obviously  represents  one  narrow  facet  of 
analyzing  failure  of  components . Thus  far  nothing  has  been  said  concerning 
time  dependent  failure  response  of  structural  ceramics.  It  is  worthwhile 
commenting  on  studies  directed  towards  the  objective  of  understanding  such 
phenomena.  While  fairly  extensive  data  is  available  to  the  designer  which 
permits  materials  choices  for  particular  applications,  it  is  worth  noting 
that  many  of  the  inherent  mechanisms  which  produce  microstructural  and 
physical  and  chemical  changes  are  not  fully  understood.  It  is  not  possible 
at  this  time  to  present  a  comprehensive  mathematical  model  for  all  ceramics 
which  accurately  accounts  for  all  controlling  materials  phenomena,  such  as 
the  physical  changes  induced  under  severe  environmental  limits,  as  well  as 
creep,  slow  crack  growth  and  other  aspects  of  time  dependent  behavior. 

An  accurate  determination  of  a  prescribed  probability  for  specific 
minimum  of  cycles  to  failure  of  the  Rene  95  material  can  be  realized  if 
all  data  is  censored,  represented  by  the  Weibull  function  where  corresponding 
parameters  are  described  by  M.L.  methods.  The  uncertainty  involved  in  the 
graphical  approach  should  be  avoided.  The  argument  that  it  is  easier  to  use 
is  not  valid  at  the  present  time.  Tbe  simplest  programmable  calculator  can 
provide  the  necessary  computation  for  the  M.L.  method.  Although  probabilistic 
life  estimates  have  been  made  for  tbe  previously  mentioned  material,  the  M.L. 
method  can  also  be  applied  to  most  other  fatigue  data. 
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ABSTRACT 1  Many  military  Installations ,  as  well  as  industrial 
facilities,  conduct  operations  which  can  present  a  safety  hazard 
to  personnel,  property,  vehicles,  industrial  facilities,  and 
communities  that  lie  in  the  neighborhood  of  the  installation* 

This  report  considers  a  military  installation  which  tests  missiles 
and  rockets,  as  an  example,  and  discusses  procedures  for  estimating 
risks  from  these  operations  to  neighboring  facilities*  The 
procedures  of  this  report  should  also  be  applicable  to  many  other 
types  of  operations  that  are  found  on  a  variety  of  military 
installations  and  with  a  number  of  Industries*  Estimates  of 
risk  will  be  provided  for  certain  critical  points  and  also  in 
the  form  of  contour  maps,  which  will  show  the  risks  for  the 
entire  region.  Methods  for  obtaining  confidence  limits  for 
these  risks  will  also  be  discussed*  Finally,  some  suggestions 
are  offered  regarding  comparisons  of  risks  from  operations  to 
every  day  life;  and  from  these  comparisons,  it  may  be  possible 
to  decide  whether  the  risks  from  an  operation  are  sufficiently 
small  to  be  accepted. 


1.  INTRODUCTION! 

a*  Operations  conducted  by  many  military  installations  may 
cause  safety  hazards  to  personnel,  property,  vehicles,  ind¬ 
ustrial  facilities,  and  communities  in  the  neighborhood  of 
the  installation.  These  operations  Include  the  testing  of 
rockets,  missiles,  airborne  targets,  aircraft,  explosive 
devloes,  materiel  emitting  radiation,  etc.  It  is  the  purpose 
of  this  report  to  discuss  a  few  methods  for  estimating  the 
risks  created  by  military  operations  to  neighboring 
facilities* 

b.  Specifically,  consider  a  military  installation  with  the 
primary  mission  of  testing  rockets  and  missiles,  and  the 
risks  that  may  occur  as  a  result  of  a  malfunctioning  round 
flying  off-course,  impacting  in  an  undesired  location,  and 
causing  serious  damage  to  an  industrial  facility  located  at 
the  unplanned  impact  point.  The  methods  of  this  report  are 
easily  extendable  to  other  types  of  military  installations, 
various  types  of  industrial  operations,  and  a  variety  of 
possible  targets* 


♦This  is  a  condensation  of  the  original  report.  A  copy  of 
the  complete  report  may  be  obtained  by  writing  to  the  author 
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c.  The  purpose  of  this  study  may  be  to  consider  the  risks  upon  one 
(or  possibly  two  or  three)  speoific  target.  This  target  nay  be 
an  industrial  plant,  a  town,  a  highway,  etc.  On  the 
it  nay  be  desired  to  learn  the  riBks  at  «y«y  point  lyir«  within 
the  region  surrounding  the  military  installation.  If  it  is  tne 
latter®  the  end  product  of  the  study  may  be  i * ■ 

area,  with  oontour  lines  indicating  the  probability  that  during 
any  12  month  period,  a  malfunctioning  object  may  strike  at  a 
point  along  the  line  and  do  damage  greater  than  at  some  specified 
level.  The  reasons  for  studying  the  risks  for  an  entire 
neighborhood  lnoludei  (1)  The  entire  region  around  this 
installation  may  be  covered  with  industrial  plants,  farms  and 
ranches,  communities,  highways,  and  other  points  of  concern; 
or  (2)  a  company  may  want  to  locate  a  plant  somewhere  in  the 
region  around  the  installation  and  will  want  to  know  which  areas 
are  safe  enough. 


d.  A  primary  working  tool  for  this  study  is  a  set  of  maps.  These 
maps  will  cover  the  entire  region  of  concern.  They  will  be  in 
black  and  white,  will  show  very  little  detail,  but  will  show  the 
boundaries  of  the  test  facility,  major  highways,  larger  communities 
and  points  of  interest.  These  maps  will  also  contain  referenoe 
polnte,  which  may  be  thought  of  as  the  points  of  intersection  of 
equally  spaced  vertical  and  horizontal  lines.  The  closeness  of 
these  reference  points  will  depend  upon  the  accuracies  desired  and 
the  amount  of  work  that  one  wishes  to  do  when  evaluating  the  data. 
Figure  one  is  an  example  of  such  a  map,  with  reference  points 
located  10  ml.  x  10  ml.  apart.  Actual  working  maps  should  be  two 

to  three  times  as  long  and  wide  aa  figure  one. 

e.  The  target  that  will  be  used  as  the  example  for  this  report 
will  be  an  industrial  complex  covering  100  acres  =  .1563  sq.  mi. 

It  will  be  assumed  that  if  an  object  tested  under  the  project 
under  consideration  impacts  within  this  100  acre  complex,  damage 
at  an  unacceptable  level  has  a  lOOjf  probability  of  occurlng. 

(Note  references  2  and  3,  or  Appendix  B  of  the  original  report, 
for  some  techniques  for  the  extent  of  damage  to  expect  if  an 
impact  occurs.)  Finally,  risks  will  be  computed  over  a  12  month 
period  of  time.  It  is  then  believed  that  the  risks  obtained  from 
a  study  such  as  this  can  easily  be  extended  to  other  types  of 
targets  and  for  different  periods  of  time. 

2.  Project  Classification; 

a.  Record  all  test  programs  that  are  presently  assigned  to  the 
installation  as  well  as  those  expected  to  be  assigned  within  the 
next  few  years*  Also,  review  some  of  the  programs  that  were 
previously  assigned  to  the  installation,  because  some  of  these 
might  provide  information  that  can  be  useful  In  the  evaluation 
of  present  or  future  systems. 


b.  Determine  which  projects  present  no  risk  to  targets  of  concern 
and  remove  them  from  further  study.  These  projects  may  involve 
testing  objects  with  Insufficient  range  to  reach  the  boundaries  of 
the  test  installation;  the  test  objects  may  be  of  such  material 
that  they  will  do  no  serious  damage  if  they  do  impact  in  a  critical 
area;  or  the  system  may  be  so  reliable  that  an  unplanned  impact 
i 8  virtually  impossible. 


100 


o.  Collect  the  following  information  for  all  of  the  remaining 
projeotsi 

(1 )  System  design  and  performance  characteristics. 

(2)  Sxpeoted  reliability  and  accuracy  of  the  system. 

(3)  Expected  number  of  future  tests. 

(■4)  Expert  opinion  on  the  reliability  and  performance 
oharacterisltcs  of  the  system. 

(5)  Mass,  shape,  penetration  capabilities,  and  other  des- 
tructlve  characteristics  of  the  test  object. 

(6)  If  test  data  exists,  collect  the  following! 

(a)  Total  number  of  tests  that  have  been  conducted. 

(b)  Number  of  rounds  that  have  malfunctioned,  reulting  in 
unplanned  Impacts. 

(c)  The  location  of  unplanned  impacts. 

d.  Order  the  projects  on  the  basis  of  the  amount  of  test  data 
that  is  available.  Analyze  those  projects  with  a  good  deal  of 
data  first,  because  the  results  from  these  projeots  may  be 
useful  when  evaluating  those  projects  with  little  or  no  test  data. 

3«  Constructing  a  Footprint! 

a.  A  footprint  must  be  constructed  for  each  project  which  can 
provide  a  threat  to  any  target  under  consideration.  This  foot¬ 
print,  when  constructed,  will  be  superimposed  upon  figure  one, 
as  illustrated  by  figure  2.  (see  page  7).  It  should  be  pointed 
out  that  some  projects  will  require  more  than  one  footprint,  one 
for  each  test  configuration. 

b.  The  footprint  that  is  being  used  to  l.llutrate  these  procedures 
consists  of  a  set  of  conoentric  ellipses.  The  procedures  used 

to  construct  this  footprint  are  discussed  in  detail  in  appendices 
A,  Ci  and  D  of  the  original  report.  Examples  of  other  types  of 
footprints  are  discussed  in  Appendix  B  of  the  original  report. 

The  footprint  used  in  this  study  may  be  desireable  if  it  can  be 
determined  that! 

(1)  The  unplanned  impacts  appear  to  be  distributed  approximately 
as  the  bivariate  normal. 

(2)  The  center  of  impact,  the  angle  of  rotation,  and  the 
variances  oan  all  be  estimated  by  one  procedure  or  another. 

c.  It  is  assumed  that  in  this  illustrated  example,  there  exists 
a  considerable  amount  of  test  data  which  can  be  used  to  estimate 
the  required  parameters.  The  coordinates  of  the  unplanned  impacts 
are  listed  in  table  1  (Page  6).  There  will  be  many  instances  in 
which  it  will  be  necessary  to  estimate  these  parameters  by  methods 
that  do  not  depend  upon  test  data. 

d*  The  projeot  used  to  illustrate  these  procedures  will  be 
referred  to  as  Project  A.  The  following  information  will  be 
used  to  construct  tnis  footprint,  in  addition  to  the  information 
listed  in  paragraph  le; 
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(1)  240  relevant  tests  have  been  conducted  under  project  a.  Of 
this  number,  18  rounds  were  unreliable  and  unplanned  impacts 
resulted,  impaot  data  for  the  16  unplanned  impacts  may  be 
found  in  columns  2  and  3  of  table  1. 

(2)  it  has  been  estimated  that  there  will  be  about  32  tests  each 
year  for  several  years  to  come.  Thus*  we  oan  expect  about 
2.40  unplanned  impacts  per.  year. 

(3)  Assume  that  for  this  type  of  test,  a  flight  surveillance 
system  has  a  90*  capability  of  destroying  or  diverting 
malfunctioning  rounds  so  that  no  damage  will  occur  to  a 
target. 

e.  The  x  and  y  coordinates  of  the  18  data  points  (see  col.  2  4  3 
of  table  1)  must  be  transformed  as  follows  before  constructing 
the  footprint* 


(1)  The  (x,y)  coordinate*-  must  be  translated  to^provide  an 
(x»,y*)  coordinate  system  such  that  x'  and  y»  equal  zero. 
The  (x*,y*)  coordinates  are  listed  in  columns  4  4  5  of 
table  1*  and  the  values  of  x^y*  are  listed  in  column  6. 

(2)  a  rotation  of  axes«  providing  {x”,ym)  coordinates,  is 

necessary  so  that  will  equal  zero.  The  rotation 

formulas  arei  x"  *  x*«cos  9  +  y*«sin«  and 

y"  =  -x'»sin  9  ♦  y*»oos  9 
Where  9  may  be  obtained  as  follows* 


tan2«  = 


2  S  x*»y* 

•  z  y 


The  derivation  of  the  above  formulas  may  be  found  in 
appendix  C  of  the  original  report. 


The  (x^y")  coordinates  for  the  18  unplanned  impact  points 
may  be  found  in  columns  7  and  8  of  table  1,  page  6.  The 
values  of  x^y"  ar£  listed  in  column  8.  It  may  be  seen 
from  table  1  that  x"  *  y"  *  rxmym  *  0,  which  is  exactly 
what  the  translation  and  rotation  was  expected  to 
accomplish.  Using  (x^y*)  data  from  table  1,  9  *  37.7°. 


f.  The  footprint  will  now  be  constructed  and  superimposed  upon 
figure  one,  usin*  the  following  procedures.  This  will  be 
illustrated  by  figure  two. 


(1)  The  footprint  will  consist  of  9  arbitrarily  chosen, 

concentric  ellipses.  These  ellipses  will  be  constructed 
to  contain  20*,  40*,  60*,  80*,  90*,  95*.  99*,  99.5*,  and 
99.9*  of  the  expected  unplanned  impacts.  If  the  number  of 
ellipses  1 8  Increased,  it  should  result  in  improving  the 
precision  of  the  estimates  of  risk.  It  will,  however, 
increase  the  labor  required  to  obtain  the  estimates. 
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12) 


2  2 

The  9  ellipses  will  be  of  eha  forat  iZ -  *  i*  ■  X , 

a2  b2 _ 

since  rx«y«  «  0.  a  -  k*sx«(  b  »  k»ey«;  k  «=  •y'-ii'inu  - 
The  foraule  for  k  is  developed  from  the  x*  distribution 
with  2  degrees  of  freedom,  in  appendix  D  or  the  original 
report. 


O)  Table  1,  showing  the  lx,j)j  end  lxn,y*)  coordinates 

for  the  18  unplanned  impact  points  for  Project  A.  Is  shown 
below. 


U)  (2)  (3)  (4)  (5)  (6)  ,  _  Hi _ _J2i 
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-»-t/)l  JieSyMl^nd  (x‘ijr* )  Coordinates  of  16  Unplanned 
lapect  Points. 


(b)  figure  2,  shows  the  footprint  for  Project  A. 

superimposed  upon  the  area  map  of  figure  one. 
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4.  Assignment  of  Probabilities  to  Reference  Polnto. 


a*  Reference  point  8-14  will  be  used  as  an  example  to  illustrate 
how  risks  are  assigned*  Note  that  point  E-14  is  located  within 
the  band  which  ia  bound  by  the  *95  end  *99  ellipses*  (see  the 
footprint*  fig.  2.) 


b*  Begin  by  computing  the  probability  that  over  a  period  of  12 
months*  a  test  object  from  Project  A  will  impact  within  a  target 
as  described  in  para,  le*  and  do  damage  at  an  unacceptable  level* 

The  computation  will  be  for  a  target  lying  within  the  band  bound 
by  the  *95  and  *99  ellipses*  the  band  which  oontains  the  reference 
point  E-14.  Therefore*  the  probabilities  will  apply  to  such  a 
target  lying  at  or  near  point  E-l4. 

(1)  The  risk  from  a  single  test  assigned  to  Project  A  is  as 

f oil  cm  si 

R  «  .1563  *  1*00  x  18/240  x  .10  x  .04/582.674  «= 

8.04?4  x  10“®,  where  1 

(a)  .1563  sq.  ml.  a  100  acres*  which  is  the  size  of  the 
Industrial  complex  under  consideration* 

(b)  1.00  is  the  probability  that  damage  at  an  unacceptable 
level  will  occur  if  there  is  an  Impact  within  the 
Industrial  complex.  (note  appendix  E  of  the  original 
report  for  further  discussion). 

(c)  18/240  =  .0750  comes  from  18  unplanned  Impacts 
(unreliable  rounds)  from  a  total  of  24o  tests. 

(d)  .10  is  the  estimate  of  the  probability  that  the  flight 
surveillance  system  will  fall  to  destroy  or  divert  the 
unreliable  round  in  such  a  way  as  to  avert  an  unacceptable 
level  of  damage.  Assume  this  was  based  upon  380  attempts 
in  which  33  were  not  successful. 

(e)  .04  is  the  probability  of  falling  in  the  band  that  is 
bound  by  the  .95  and  .99  ellipses. 

(f)  582.674  sq.  mi.  is  the  area  of  this  band. 


(2)  Since  it  is  estimated  that  there  will  be  32  tests  per.  year 
under  Project  A*  the  risk  to  the  target  from  Project  A,  over 
a  12  month  period  is  as  follows t 


P(Rl8k8  for  32  tests)  »  1  -  (1  -  R)^2.  However,  for  small 

P  »  32 *R  »  32(8.0474  xlO"8)  =  2.5752  x  10*6 

Also,  for  small  R*  this  can  be  extended  to  any  number  of 

years  by  multiplication.  For  example,  for  25  years* 

P  »  25(2.5752  x  10“6)a  6.438  x  10*5. 

Note  that  2.5752  x  10"°  is  the  risk  assigned  to  the  band 
bound  by  the  ,95  and  *99  ellipses.  Risks  from  Project  A 
to  the  other  bands  are  computed  by  a  similar  method.  (Note 


R 


» 


the  footprint  on  figure  2.) 
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c.  Finally,  the  risks  will  be  computed  for  a  target  ac  described 
in  para*  Je»  located  at  point  E-l4,  from  all  projects  located  at 
the  Installation,  and  for  a  period  of  12  months* 

(1)  The  first  step  is  to  review  all  footprints  from  all  projects 
and  observe  which  contain  point  E-l4* 

(2)  Then,  using  the  methods  of  para*  4b, determine 

risks  for  point  E-14,  from  all  relevant  footprints*  These 
risks  will  be  added  together  to  give  the  overall  risk  from 
the  entire  military  installation'.  It  was  observed  that 
there  were  10  footprints  that  covered  the  point  E-l4,  and 
the  risks  associated  with  each  is  listed  below* 

PROJECT 

A 

B 
C 

D 
E 


(3)  From  these  results,  it  can  be  seen  that  the  probability 
is  about  one  in  100,000  that  during  any  12  month  period 
an  object  from  the  Installation  may  impact  at  the  target 
site,  located  near  point  (E,l4),  and  do  damage  at  an 
unacceptable  level* 

5*  Constructing  the  Reference  Point  and  Contour  haps 

a.  By  using  the  procedures  of  section  5,  it  is  possible  to  assign  a 
probability  to  every  reference  point  on  the  map,  and  it  is  quite 
possible  that  a  map  with  the  level  of  risk  recorded  at  each  reference 
point  is  all  that  a  user  will  want*  Such  a  map  is  illustrated  by 
figure  3. 

b.  If  a  contour  map  is  desired,  proceed  as  follows! 

(1)  Determine  the  probabilities  desired  to  assign  to  each 
contour  line. 

(2)  Locate  the  adjacent  reference  points,  with  probabilities 
just  greater  than  and  just  less  than  that  of  the  contour 
lines  being  considered* 

(3)  By  appropriate  interpolation  (probably  logarithmic),  determine 
where  the  contour  lines  should  lie  between  these  ref*  points* 

(4)  Connect  these  points  by  ruler  and  french  curve  and  thus 
construct  the  contour  lines. 

c*  Figure  3  illustrates  how  the  reference  points  can  be  labeled, 
showing  the  risks  of  lmoact  and  serious  damage  at  each  of  these 
points.  Then,  using  the  reference  points  as  a  guide,  a  contour 
map  has  been  superimposed  upon  the  map  of  the  region.  If  the 
labeled  reference  points  appear  to  be  most  useful,  it  will  be 
unnecessary  to  construct  the  contour  lines. 
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risks  for  each  contour. 
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6.  Confidence  Limits. 

A  great  deal  of  additional  study  is  needed  to  develop 
improved  procedures  for  determining  confidence  limits  for  risks 
at  desired  points  throughout  the  region  surrounding  the  military 
installation.  Pages  9t  11*  and  12  of  the  original  report  discuss 
how  a  modification  of  the  procedures  of  "propagation  of  errors" 
may  be  used  to  obtain  estimates  of  the  upper  confidence  limits 
for  the  risks.  References  4,  5,  and  6  discuss  the  methods  of 
propagation  of  errors.  It  is  hoped,  however,  that  better  methods 
than  propagation  of  errors  will  someday  be  found* 


7*  How  3afe  is  Safe  Bnough? 

a.  When  it  is  indicated  from  a  study  that  the  risk  at  some  point 
is  some  figure  (one  in  100,000  for  example),  the  user  is  likely 
to  ask  several  questions  about  this  figure,  including* 

(1)  What  does  this  level  of  risk  mean? 

(2)  Is  this  level  safe  enough? 

(3)  Why  should  we  accept  any  risk? 

b.  The  following  suggestions  may  help  in  answering  some  of  the 
questions  of  the  users. 

(1)  Risks  from  a  military  installation  may  be  compared  to 
risks  that  occur  in  every  day  life.  To  make  these 
comparisons,  such  publications  as  "Acoldent  Pacts" 

(see  reference  12)  may  provide  some  useiul  information. 
For  example,  about  one  out  of  every  400o  Americans  will 
die  as  a  result  of  a  motor  vehicle  accident  within  the 
next  12  months,  and  one  of  every  100  Americans  can  expect 
to  receive  a  disabling  Injury  from  the  same  source. 

Thus,  if  it  can  be  shown  that  the  operations  from  the 
installation  are  very  much  less  than  the  risks  from  the 
dally  use  of  the  American  automobile,  this  may  be 
convincing. 

(2)  Perhaps  risks  c«n  be  reduced  to  economic  terms.  For 
example,  if  a  plant  is  worth  10  million  dollars,  and 
the  annual  risk  is  one  in  100,000.  This  would  indicate 
a  risk  of  only  $100  per.  year.  Thus,  insurance  may  be 
available  to  cover  such  a  risk. 

(3)  Finally,  when  users  are  reluctant  to  accept  any  risk,  it 
should  be  pointed  out  that  some  risk  is  associated  with 
every  activity  carried  on  by  the  human  race.  Thus,  it 
is  necessary  to  find  ways  to  reduce  the  risk  of  all 
activities  to  an  acceptable  level. 
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THE  1980  SAMUEL  S.  WILKS  MEMORIAL  MEDAL 
Frank  E.  Grubbs 

The  Samuel  S.  Wilks  Memorial  Medal  Award  was  initiated  in 
1964  by  the  US  Army  and  the  American  Statistical  Association, 
and  has  been  administered  for  the  Army  by  the  American  Statistical 
Association,  a  non-profit,  educational  and  scientific  society 
founded  140  years  ago  in  11839.  The  Wilks  Medal  and  Award  is 
given  each  year  to  a  statistician  -  and  a  top-nc  one!  -  and 
is  based  primarily  on  his  contributions  to  the  vancemenf.  of 
scientific  or  technical  knowledge  in  Army  stai  tics,  ingenious 
application  of  such  knowledge,  or  successful  t  tivity  in  the 
fostering  of  cooperative  scientific  matters  which  coincidentally 
benefit  the  Army,  the  Department  of  Defense,  the  US  Government, 
and  our  country  generally.  The  Award  consists  of  a  medal,  with 
a  profile  of  Professor  Wilks  and  the  name  of  the  Award  on  one 
side,  the  seal  of  the  American  Statistical  Association  and  the 
name  of  the  recipient  on  the  reverse  side,  and  a  citation  and 
honorarium  related  to  the,  magnitude  of  the  Award  funds,  which 
were  generously  donated  by  Phillip  G.  Rust  of  the  Winnstead 
Plantation,  Thomasville,  Georgia.  Mr.  Rust  originally  stimulated 
the  interest  of  Sam  Wilks  in  distributional  properties  of  the 
"extreme  spread"  (bivariate  range),  a  measure  of  the  "accuracy" 
of  rifle  shot  on  a  target. 

These  annual  Aroy  Design  of  Experiments  Conference.,  at  which 
the  Wilks  Medal  is  awarded  each  year,  are  sponsored  by  the  Army 
Mathematics  Steering  Committee  on  behalf  of  the  Office  of  the 
Chief  of  Research,  Development  and  Acquisition,  Department  of  the 
Army. 
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BIOGRAPHY  OF  THE  RECIPIENT  OF  THE  1980  SAMUEL  S.  WILKS  MEMORIAL  MEDAL 

by 

Churchill  Elsenhart 


The  1980  Samuel  S.  Wilks  Memorial  Medalist  Is  an  Internationally  recognized 
authority  on  statistics  whose  leadership  has  contributed  greatly  to  the  adoption, 
acceptance,  and  effective  use  of  statistical  thinking  and  statistical  methods  In 
many  areas  of  research  and  human  affairs,  In  both  the  governmental  and  private 
sectors . 

He  was  born  In  Philadelphia,  Pennsylvania,  on  November  5,  1912.  When  three 
years  old,  his  family  moved  to  the  West  Coast  and  lived  successively  In  Berkeley, 
Fresno,  and  Los  Angeles,  California,  and  Portland,  Oregon,  while  his  father, 
primarily  a  physical  anthropologist  trained  at  Oxford  under  a  Rhodes  Scholarship, 
with  Ph.O.  In  Philosophy  from  the  University  of  Pennsylvania,  taught  at  the 
University  of  California  at  Fresno  State  College,  served  with  the  California 
Commission  on  Emmlgratlon  and  Housing,  and  taught  at  Reed  College. 

At  the  age  of  9,  he  had  a  newspaper  route  In  Portland  and  wen  a  Thanksgiving 
turkey  for  an  unusually  large  Increase  In  circulation.  Recalling  this  at  the 
time  (1962)  of  his  appointment  as  President  of  the  University  of  Rochester,  he 
said:  "I  remember  this  partly  because  It  was  my  first  lesslon  In  pitfalls 
of  statistical  measurement.  The  base  set  for  measuring  the  growth  of  my  route 
was  the  month  of  August,  and  It  was  no  feat  at  all  to  triple  circulation  when 
the  Reed  College  faculty  returned  from  their  vacations  and  especially  when 
hundreds  of  students  arrived  at  the  college.  Most  of  all,  I  remember  that  turkey 
because  my  father  and  I  received  It— alive— In  downtown  Portland  and  took  It 
home  by  streetcar." 

The  family  moved  to  Minneapolis,  Minnesota  In  1923  when  his  father  joined 
the  faculty  of  the  Department  of  Anthropology  at  the  University  of  Minnesota. 

As  a  boy  In  Minneapolis,  our  Wilks  Medalist  became  an  ardent  stamp  collector, 
tennis  player  and  photographer.  He  organized  a  small  stamp  company,  selling 
stamps  partly  by  mall  but  mostly  to  boys  In  the  neighborhood.  One  of  his  best 
customers  was  Richard  M.  Scammon,  who  was  later  to  become  Director  (1961-1965) 
of  the  U.  S.  Bureau  of  the  Census  and  to  serve  with  our  Medalist  on  the 
President's  Commission  on  Federal  Statistics. 

Our  1980  Wilks  Medalist  entered  the  University  of  Minnesota  In  the  fall 
of  1928;  majored  In  psychology  with  a  minor  In  sociology,  took  nearly  as  much 
work  In  mathematics  and  In  philosophy,  and  gained  valuable  writing  experience 
as  an  editorial  writer  for  the  college  paper,  the  Minnesota  Dally,  then  known 
as  "the  world's  largest  college  dally".  His  high  scholarship  lea  to  his 
election  to  Phi  Beta  Kappa,  and  to  receipt  of  his  A.B.  degree  magna  cum  laude 
In  1932  at  the  age  of  19.  A  paper  on  "The  Influence  of  Color  on  Apparent 
Size"  which  he  wrote  during  his  junior  year  was  published  In  the  Journal  of 
General  Psychology,  Vol.  13  (1935)  and  has  been  reprinted  In  books  of  readings 
In  psychology. 
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Shortly  before  graduation  he  decided  on  a  career  In  economics,  with 
emphasis  on  mathematical  economics  and  statistics.  He  remained  at  the 
University  of  Minnesota  for  the  academic  year  1932-1933  to  continue  his  study 
of  mathematics  and  to  study  economics  and  then  moved  on  to  the  University 
of  Chicago,  where  he  held  a  University  Fellowship  In  the  Department  of  Economics 
from  1933  to  1935.  It  was  at  Chicago  that  he  began  life-long  friendships 
with  Milton  Friedman  and  George  J.  Stlgler  (father  of  the  present  Theory  and 
Methods  Editor  of  the  Journal  of  the  American  Statistical  Association).  They 
(and  two  other  fellow  students)  selected,  arranged,  and  saw  through  to 
publication,  the  first  book  of  Professor  Frank  H.  Knight's  essays.  The  Ethics 
of  Competition  (Harper  &  Brothers,  1935)— an  early  Instance  of  our  Medalist's 
drive  to  see  worthwhile  material  formally  published  In  the  open  literature. 

He  spent  the  academic  year  1935-1936  as  Granville  W.  Garth  Fellow  In 
Political  Economy  at  Columbia  University,  where  he  studied  with,  among  others, 
Wesley  C.  Mitchell,  one  of  the  most  eminent  of  American  economists,  doyen  of 
American  business  cycle  analysts,  the  1918  President  of  the  American  Statistical 
Association,  co-founder  (1920)  and  director  of  research  of  the  National  Bureau 
of  Economic  Research  (an  Independent  non-profit  organization),  etc.,  and 
especially  with  Harold  Hotelling,  a  pioneer  In  mathematical  economics, 
econometrics,  and  multivariate  statistical  analysis,  and  the  person  In  the 
United  States  then  most  versed  In  R.  A.  Fisher's  theory  of  small  samples  and 
statistical  Inference.  Furthermore,  this  was  the  year  In  which  Hotelling 
revealed  the  potential  of  systematic  treatment  of  functions  of  the  relative 
ranks  of  sample  observations  as  a  basis  for  what  are  now  termed  "distribution- 
free"  or  "non-parametrlc"  methods  of  statistical  Inference,  through  his  joint 
paper  with  Margaret  Richards  Pabst,  "Rank  correlation  and  tests  of  significance 
Involving  no  assumption  of  normality".  Presented  at  the  New  York  meeting  of 
the  American  Mathematical  Society  on  October  26,  1935,  and  published  In  the 
March  1936  Issue  of  the  Annals  of  Mathematical  Statistics  (Vol .  7,  29-43), 
this  paper  marked  "the  true  beginning''  of  research  on  such  methods  as  "an 
Important  special  field  of  statistics"  (I.  Richard  Savage,  JASA,  Vol.  58, 
p.  844,  December  1953).  Hotelling  himself,  his  teaching,  and"  h"1  s  research 
exerted  a  far  reaching  influence  on  our  1980  Wilks  Medalist's  career  In 
statistics  as  will  become  evident  as  we  proceed.  For  the  moment  we  may 
note  simply  that  our  Medalist's  first  statistical  paper,  "The  Poisson  Distribution 
and  thy  Supreme  Court",  published  In  the  June  1936  Issue  of  JASA  (Vol.  31,  376-380) 
was  written  as  a  course  paper  for  Hotelling. 

Ourlng  the  summer  of  1935  and  the  academic  year  1936-37,  our  1980  Wilks 
Medalist  was  an  economist  for  the  National  Resources  Committee,  a  New  Deal 
agency  In  Washington,  D.C.  Milton  Friedman  was  there  also,  and  while  there, 
wrote  his  paper,  "The  use  of  ranks  to  avoid  the  assumption  of  normality 
Implicit  In  the  analysis  of  variance"  (JASA  32,  675-701,  December  1937)  in 
which  he  thanks  our  Medalist  for  bringing  to  his  attention  a  more  Informative 
method  of  handling  tied  ranks.  While  serving  with  this  Committee,  our  Medalist 
co-authored  a  book  on  estimates  of  consumer  expenditures  In  the  United  States 
for  1935-1936,  and  worked  on  an  article  on  the  temporal  stability  of  consumption 
that  saw  publication  In  1942. 
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His  first  teaching  appointment  was  as  Instructor  In  Political  Economy  et 
Yale  University  in  1937-38.  "The  high-point  of  that  year,"  he  has  said,  "was 
getting  to  know  Irving  Fisher  who  was  especially  hospitable  because  of  my 
Interest  In  mathematical  economics,  a  field  In  which  he  had  pioneered  forty 
years  earlier." 

In  the  fall  of  1938  our  Wilks  Medalist  joined  the  Department  of  Economics 
at  Stanford  University,  an  association  that  continued,  with  extensive  Interruptions, 
until  1946.  At  Stanford  he  taught  courses  in  economic  theory,  mathematical 
economics,  and  advanced  statistics.  His  first  contribution  to  statistical 
methodology,  "The  correlation  ratio  for  ranked  data",  published  In  the  September 
1939  Issue  of  JASA  (Vol.  34,  533-538),  grew  out  of  consulting  he  did  with  a 
psychologist  durTng  his  first  year  at  Stanford,  and  anticipated,  or  perhaps  we 
should  say  paralleled.  Independent  work  by  M.  G.  Kendall  and  B.  Bablngton  Smith, 
published  In  the  September  1938  Issue  of  the  Annals  of  Mathematical  Statistics 
(Vol.  10,  No.  3,  275-287— our  Medalist's  "rank  correlation  rat1oh,  nf,  Is  exactly 

their  "coefficient  of  concordance",  W.  A  dozen  years  later,  at  the  University 
of  Chicago,  he  returned  to  consideration  of  methods  of  analysis  of  ranked  data 
as  means  of  avoiding  the  Implications  of  the  normality  assumption  underlying 
many  coirmon  statistical  tests,  and  with  William  H.  Kruskal  prepared  a  compre¬ 
hensive  treatment  of  the  "Use  of  ranks  In  one-criterion  variance  analysis", 
published  In  the  December  1952  Issue  of  JASA  (Vol.  47,  583-621),  In  which  they 
Introduced  their  now  widely  used  H  test,  thus  designated  In  honor  of  Hotelling. 

During  1939-40  and  the  last  half  of  1941,  our  1980  Wilks  Medalist  was  a 
Carnegie  Research  Associate  at  the  National  Bureau  of  Economic  Research  (NBER) 

In  New  York  City,  on  leave  of  absence  from  Stanford;  and  took  advantage  of  the 
proximity  of  Columbia  University  to  attend  the  lectures  there  of  Abraham  Wald, 
newly  arrived  (1939)  from  Austria.  At  the  NBER  he  was  closely  associated  with 
Arthur  F.  Burns  (later  Chairman,  President's  Council  of  Economic  Advisors; 

Chairman,  Board  of  Governors  of  the  Federal  Reserve  System,  etc.),  Wesley  C. 

Mitchell  (mentioned  above),  Frederick  C.  Mills  (1934  President  of  the  ASA; 
and  author  of  a  statistical  methods  text,  the  second  edition  of  which  In  1938 
Incorporated  many  of  the  new  Ideas  and  methods  of  R.  A.  Fisher  and  was  used 
widely  by  students  In  economics,  business  and  other  fields),  and  Geoffrey  H. 

Moore  (who  later  became  the  1968  President  of  the  American  Statistical 
Association  and  Commissioner  1969-1973,  of  Labor  Statistics,  U.  S.  Department 
of  Labor.)  Analysis  and  Interpretation  of  economic  time  series  occupied  center 
stage  at  the  NBER.  With  Moore  he  published  A  Test  of  Significance  for  Time 
Series  and  Other  Ordered  Observations  (National  Bureau  or  Economic  Research 
Technical  Paper  1,  September  1941,  59  pp)  in  which  they  developed  a  test  for 
randomness,  relative  to  either  a  monotonic  or  oscillatory  trend,  based  on 
the  distribution  of  length  of  runs  up  and  down;  and  two  articles  (with  Moore) 

In  JASA,  "A  significance  test  for  time  series"  (Vol.  36,  401-409,  September  1941), 
and  ’Time  series  significance  tests  based  on  signs  of  differences"  (Vol.  38, 
1953-1964,  June  1943).  The  first  provided  a  brief  summary  of  the  Technical 
Paper,  with  examples  of  the  application  of  the  test  developed  therein;  the 
second,  an  alternative  but  not  Independent  test  based  on  the  total  number  of 
runs  up  and  down.  These  two  tests  are  today  standard  tools  of  nonparametrlc 
statistics. 
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At  NBER,  he  also  carried  out  much  of  the  research  embodied  In  his  paper 
"Compounding  probabilities  from  Independent  significant  tests",  published  In 
the  July-Oct.  1942  Issue  of  Econometrics  (Vol.  10,  Nos.  3&4,  229-248),  In  which 
he  gave  a  clear  mathematical  exposition  of  the  basis  of  R.  A.  Fisher's  procedure 
for  combining  "significance  probabilities"  yielded  by  Independent  statistical 
tests  having  continuous  probability  distributions  (Fisher,  Statistical  Methods 
for  Research  Workers,  Fourth  Edition  (1932),  Sec.  21.1) — the  basis  of  which  was 
a  mystery  to  many  individuals  and  Incorrectly  explained  by  others— and  provided 
the  requisite  mathematical  extension  to  cases  In  which  at  least  one  of  the 
"significance  probabilities"  Is  obtained  from  a  statistical  test  having  a  discrete 
probability  distribution  such  as,  for  example,  a  rank  or  run  test.  At  that  time, 
too,  he  was  co-author  with  Milton  Friedman  of  a  paper  on  the  empirical  derivation 
of  Indifference  functions  which  saw  publication  In  1942. 

Our  1980  Wilks  Medalist  returned  to  Stanford  University  for  the  first  half 
of  1942.  The  United  States  was  then  at  war  with  both  Germany  and  Japan,  so 
rationing  and  price  control  were  matters  of  paramount  concern.  Our  Medalist 
responded  by  writing  a  paper,  "How  to  ration  consumers'  goods  and  control  their 
prices",  published  In  the  American  Economic  Review  later  that  year.  Then, 
on  April  17,  1942  our  1980  WTTks  Medalist  wrote  to  W.  Edwards  Demi  no  (then 
Head  Mathematician,  Mathematical  Advisor,  U.S.  Bureau  of  the  Census)  stating 
that  he  and  several  of  the  others  teaching  statistics  In  various  departments 
of  Stanford  considered  It  "probable  that  a  good  many  students  with  research 
training  might  by  training  In  statistics  become  more  useful  than  In  their  present 
work,  or  might  Increase  their  usefulness  within  their  present  fields"  and  asked 
for  Oemlng's  advice  on  the  development  of  "a  curriculum  adapted  to  the  Immediate 
statistical  requirements  of  the  war".  Oemlng  responded  by  April  24,  1942,  on  the 
letterhead  of  the  Chief  of  Ordnance,  War  Department,  suggesting  a  concentrated 
effort— a  "short"  course  followed  by  a  "long"  course  on  Shewhart  methods  of 
quality  control,  the  short  to  be  "for  executive  and  Industrial  people  who  want 
to  find  out  some  of  the  main  principles  and  advantages  of  a  statistical  program 
In  Industry";  the  long  course  for  "people  who  actually  Intend  to  use  statistical 
methods  on  the  job;  "both  courses  [to]  be  thrown  open  to  engineers,  Inspectors, 
and  Industrial  people  with  or  without  mathematical  or  statistical  training". 
(Portions  of  both  letters  are  reproduced  on  pages  320-321  of  the  June  1980 
issue  of  JASA . )  In  one  of  the  paragraphs  not  reproduced,  Demlng  points  out 
the  relevance  of  Wallis  and  Moore's  work  to  statistical  quality  control,  adding: 
"The  theory  of  runs  and  patterns  is  destined  to  receive  a  great  deal  of  attention 
from  now  on,  and  It  Is  a  pleasure  to  see  the  superb  effort  that  you  and  Mr.  Moore 
have  put  forth." 

The  Impact  of  Demlng's  suggestions  was  such  that  by  May  1st,  Holbrook 
Working  (Statistician  and  Economist  at  the  Stanford  Food  Research  Institute, 
and  Chairman  of  the  University  Committee  on  Statistics)  had  arranged  a  general 
meeting  of  everyone  In  statistics;  a  first  letter  about  the  course  went  out  on 
May  21  to  firms  In  the  Western  states  that  were  supplying  Army  ordnance;  and  In 
July  1942  the  first  course  was  given  at  Stanford,  by  Working  and  Eugene  L.  Grant 
(of  the  Engineering  School.)  Our  Medalist  had  been  scheduled  to  teach  this  course 
(with  Grant),  and  had  been  "beginning  to  wonder  how  to  learn  what  [he]  was  supposed 
to  teach",  when  he  was  asked  "to  head  up  an  economic  research  unit  In  the  Office 
of  Price  Administration"  In  Washington.  So  he  dropped  out  and  was  replaced  by 
Working.  The  course  was  such  a  success  that  early  In  1943  Working  was  chosen 
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to  head  the  now  famous  major  national  program  that  put  on  intensive  8-day 
courses  in  statistical  quality  control  throughout  the  country,  under  the  auspices 
of  the  Office  of  Production  Research  and  Development  of  the  United  States  Office  of 
Education.  By  March  1945  these  had  been  attended  by  more  than  1900  persons 
from  678  Industrial  concerns  in  the  United  States  and  13  In  Canada.  Many 
of  the  "students"  In  the  earlier  of  these  courses  went  out  to  serve  as 
"Instructors"  In  part-time  courses  that  brought  the  message  to  an  additional 
3,100  persons  In  American  and  Canadian  Industry.  This  program  had  an  enormously 
beneficial  effect  on  the  quality  and  volume  of  American  and  Canadian  war 
production;  and  "prepared  the  soil"  for  the  establishment  of  the  American 
Society  for  Quality  Control,  In  February  1946.  That  our  1980  Wilks  Medalist 
played  a  role  in  Initiating,  and  came  so  close  to  participating  In  this  very 
effective  venture  was  a  secret  well  kept  from  many  of  us  until  we  saw  mention 
of  It  In  the  June  1980  Issue  of  JASA. 

Our  1980  Wilks  Medalist  never  made  It  to  the  OPA  position  In  Washington. 

Before  his  appointment  to  that  position  became  official,  he  received  a  telegram 
from  Warren  Weaver,  Director  (Natural  Sciences)  of  the  Rockefeller  Foundation 
(1932-1955)  then  up  to  his  ears  In  support  of  the  war  effort  as  Chairman  (1940-42) 
of  Section  D-2  of  the  National  Defense  Research  Committee  (NDRC)  of  the  Office 
of  Scientific  Research  and  Development  (OSRD).  Weaver,  whom  our  Medalist  has 
described  as  "one  of  the  most  remarkable,  admirable,  brilliant,  sagacious  and 
civilized  human  beings  on  the  American  scene  In  the  past  half-century"  had 
perceived  an  urgent  need  for  a  concentrated  effort  focused  on  resolution  of 
the  various  mathematical  and  statistical  problems  that  were  arising  In  the  several 
armed  services  and  suppliers  of  their  material,  and  especially  those  problems 
that  were  arising  more  or  less  simultaneously  In  different  places  with,  as  he 
put  It,  "the  same  verbs  but  different  nouns";  and  was  engaged  In  setting  up 
several  mathematical  and  statistical  groups  to  do  the  "spade  work"  of  the  soon 
to  be  established  Applied  Mathematics  Panel  of  the  NDRC,  of  which  he  was  to  be  the 
Chief  (1943-46),  and  Thornton  C.  Fry  (of  the  Bell  Telephone  Laboratories),  the 
Deputy  Chief.  Wilks  had  suggested  to  Weaver  the  establishment  of  a  statistical 
group  at  Columbia  with  Hotelling  as  Principal  Investigator,  and  Hotelling  had 
brought  our  1980  Wilks  Medalist  to  Weaver's  attention.  Thus  it  came  to  pass 
that  on  July  1,  1942  our  Medalist  assumed  his  first  administrative  post.  Director 
of  Research  of  the  Statistical  Research  Group  (SRG)  at  Columbia  University  In 
New  York  City. 

SRG  got  off  to  a  start  with  just  three  experienced  researchers  or  "principals" 
as  he  terms  them  In  his  article  "The  Statistical  Research  Group,  1942-45"  in 
June  1980  Issue  of  JASA  (Vol.  75,  320-330):  Hotelling,  our  Medalist,  and  Jacob 
Wolfowltz,  another  former  student  of  Hotelling.  Before  Its  dissolution  on 
September  30,  1945  the  number  of  "principals"  had  risen  to  17— or  to  18,  If 
Frederick  Mosteller  Is  Included,  who  though  actually  on  the  payroll  of  another 
group,  worked  closely  and  extensively  with  this  SRG  for  essentially  one  full 
year  and  co-authored  two  of  Its  books  and  co-edited  one  of  these.  (The  names 
of  all  18,  with  their  respective  lengths  of  service  with  SRG,  are  listed  on 
page  324  of  our  Medalist's  aformentloned  article.)  These  "principals"  were 
supported  at  one  time  or  another  by  about  60  others:  typists,  secretaries, 
a  switchboard  operator,  an  administrative  assistant,  a  librarian,  a  messenger, 
and  about  30  young  women,  mostly  mathematics  graduates  of  Hunter  or  Vassar,  who 
did  the  necessary  computing  under  the  direction  of  Albert  Bowker.  The  "principals" 


worked  on  problems  of  tactics,  equipment,  and  operations  for  the  Army,  Navy, 
the  Air  Force  (which  was  a  branch  of  the  Army  In  World  War  II),  and  other  units 
of  OSRD.  Many  of  these  activities  stemmed  from  AMP  studies  assigned  to  SRG, 
but  a  large  number  stemmed  from  consultation  with  and  Informal  assistance  to 
Army,  Navy,  or  NDRC  groups.  Sometimes  one  problem  would  lead  to  a  related 
problem  in  another  setting,  or  experience  with  a  particular  technique  would 
lead  to  another  application  of  the  technique  to  an  unrelated  problem. 

Our  Medalist  mentions  In  his  article  a  great  number  of  the  military  problems 
on  which  SRG  worked,  so  there  Is  no  need  to  give  such  details  here.  The  most 
famous,  and  probably  the  most  Influential  and  lasting  contribution  was,  of 
course  Abraham  Wald's  development  of  sequential  analysis,  full  Instructions 
and  tables  for  the  practical  application  of  which  saw  open  publication  In 
Sequential  Analysis  of  Statistical  Data:  Applications  (Columbia  University 
Press’/  1945);  and  the  theoretical  development.  In  Wald's  book  Sequential  Analysis 
(Wiley,  1947).  Our  Medalist  In  his  article  gives  two  accounts  of  the  history 
of  sequential  analysis,  one  written  In  April  1943,  soon  after  the  development; 
and  the  other  written  from  memory  In  March  1950,  when  the  1943  memorandum  could 
not  be  located.  Both  accounts  bring  out  clearly  the  essential  roles  of  our 
Medalist  and  Milton  Friedman  In  getting  the  development  "off  the  ground" 
after  they  had  recognized  Its  possibility  of  achievement. 

Although  SRG  was  formally  dissolved  on  September  30,  1945,  our  Medalist 
stayed  on  until  March  31,  1946  to  make  sure  that  some  of  SRG's  other  wartime 
contributions  achieved  open  publication  In  a  unified  form  creditable  to  both 
the  Individuals  concerned  and  SRG.  Although  he  listed  himself  alphabetically 
as  the  third  editor  of  Selected  Techniques  of  Statistical  Analysis  (McGraw-Hill, 
1947),  and  alphabetically  as  the  fourth  of  the  editors  of  Sampling  Inspection 
(McGraw-Hill,  1948)  he  was  In  fact  the  Editor-In-Chief  for  both/ 

As  Director  of  Research  of  SRG,  our  1980  Wilks  Medalist  brought  together 
an  absolutely  extraordinary  group  of  research  workers  In  statistical  theory 
and  methodology.  In  both  number  and  quality.  The  experience  of  working  In  SRG 
contributed  significantly  to  the  subsequent  careers  of  a  substantial  number  of 
the  "principals".  Many  became  leaders  In  statistics  In  the  next  chree  decades. 
Four  became  President  of  the  American  Statistical  Association:  Bowker  (1964), 
our  Medalist  (1965),  Mosteller  (1967),  and  Elsenhart  (1971).  Seven  became 
President  of  the  Institute  of  Mathematical  Statistics:  Wald  (1948),  Glrshlck 
(1952),  L.  J.  Savage  (1958),  Wolfowltz  (1959),  Bowker  (1962),  Herbert  Solomon 
(1965)  and  Mosteller  (1975)— Hotelling  had  been  President  In  1941.  Mosteller 
Is  the  1980  President  of  the  American  Association  for  the  Advancement  of  Science. 
Friedman  received  the  Nobel  Prize  In  Economics  for  1976.  At  least  nine 
subsequently  became  chairmen  of  university  departments  of  statistics:  K.  A. 
Arnold,  Bowker,  Glrshlck,  Hotelling,  Mosteller,  Savage,  Solomon,  Wald,  and  our 
Medalist.  Two  became  heads  of  major  universities:  Bowker  (of  two:  City 
University  of  New  York  and  University  of  California  at  Berkeley),  and  our 
Medalist  (University  of  Rochester).  Three  received  the  Samuel  S.  Wilks  Medal: 
Solomon  (1975),  Elsenhart  (1977),  now  this  year's  Medalist. 
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The  Influence  of  SRG  continues  through  the  work  of  Its  "principals"  alive 
and  deceased  (Glrshlck,  Hotelling,  Savage,  Wald)  and  through  the  statistical 
tools  developed  at  SRG,  theoretical  and  practical,  which  have  become  established 
parts  of  statistics.  Several  effective  statistical  consulting  groups  have  since 
been  modeled  on  SRG,  notably  those  at  the  Bell  Telephone  Laboratories  and  at 
the  National  Bureau  of  Standards.  But  even  today  It  Is  probably  not  saying  too 
much  to  say  that  SRG  was  the  best  statistical  research  and  consulting  group  ever. 
Those  who  worked  there  know  this  to  be  the  consequence  of  the  high  standards  of 
excellence  established,  maintained,  and  Insisted  upon  by  its  Director  of  Research, 
our  1980  Wilks  Medalist. 

After  all  that,  whatever  more  Is  said  Is  bound  to  be  antlclimatlc,  but  needs 
to  be  said  nonetheless  to  round  out  the  record  and  give  the  full  picture  of 
this  champion  of  statistical  theory  and  methodology. 

In  the  Spring  of  1946  our  Medalist  returned  to  Stanford  University  as 
Associate  Professor  of  Economics,  and  immediately  Instituted  steps  toward  the 
establishment  of  a  Statistics  Department  there.  However,  before  that  department 
came  into  being,  he  had  left  In  the  Fall  of  1946  to  join  the  University  of 
Chicago  as  Professor  of  Statistics  and  Economics  in  the  Graduate  School  of 
Business.  (Later  he  was  also  named  a  Professor  In  the  Department  of  Economics, 

In  the  Division  of  Social  Sciences).  In  1949,  he  became  Chairman  of  the  newly 
formed  Department  of  Statistics  In  the  Division  of  the  Physical  Sciences,  a  post 
he  held  until  1957.  During  his  chairmanship,  the  Department  of  Statistics  at  the 
University  of  Chicago  became  one  of  the  outstanding  departments  In  Its  field  In 
the  world  a  position  that  It  still  retains.  (In  addition,  he  played  behind-the- 
scenes  roles  In  the  establishment  of  Department  of  Statistics  at  Columbia 
University,  Harvard  University,  and  the  University  of  Rochester,  making  a  total 
of  five  whose  establishment  he  Influenced  In  minor  to  major  ways.)  In  1956  our 
Wilks  Medalist  was  appointed  Dean  of  the  University  of  Chicago's  Graduate  School 
of  Business,  became  financially  self-supporting  while  tripling  Its  annual 
expenditures,  and  came  to  be  widely  recognized  as  one  of  the  Nation's  very  best. 

There  Is  an  amusing  side  to  our  Medalist's  appointment  to  his  first  tenured 
professorship  at  the  University  of  Chicago.  He  has  no  so-called  "earned  degrees" 
beyond  his  1932  A.B.  from  the  University  of  Minnesota.  He  had  satisfied  nearly 
all  the  requirements  for  a  Ph.D.,  some  at  the  University  of  Chicago,  others  at 
Columbia  University;  had  had  two  thesis  accepted;  but  before  he  had  completed  the 
remaining  requirements  at  the  University  of  Chicago,  he  was  appointed  a  professor 
with  permament  tenure  there  and  became  Ineligible  under  that  university's  rules 
to  receive  an  advanced  degree  from  It.  He  has,  however,  received  three  honorary 
degrees,  Doctor  of  Science  from  Hobart  and  William  Smith  Colleges  (1973),  Doctor 
of  Laws  from  Roberts  Wesleyan  College  (1973),  and  Doctor  of  Humane  Letters  from 
Grove  City  College  (1975). 

While  at  the  University  of  Chicago,  our  1980  Wilks  Medalist  published  a 
number  of  noteworthy  papers  on  statistical  methodology.  The  first  was  a  long 
paper,  "Standard  sampling-inspection  procedures",  presented  at  the  25th  Meeting 
of  the  International  Statistical  Institute  at  Washington,  D.C.,  in  1947  and 
published  subsequently  in  Its  Proceedings ,  Vol.  3,  331-350.  This  was  essentially 
an  exposition  of  the  basic  principles  and  state  of  the  art  of  acceptance  sampling 
procedures  as  spelled  out  In  more  detail  In  the  SRG  book,  Sampling  Inspection 
(1948),  then  In  press.  Next  was  a  basic  paper,  "Tolerance  Intervals  for  linear 
regression",  presented  at  the  Second  Berkeley  Symposium  on  Mathematical  Statistics 
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and  Probability  at  Berkeley,  California,  In  the  summer  of  1950,  and  published  In 
Its  Proceedings,  43-51.  "Rough-and-ready  statistical  tests",  published  In  the 
March  1952  issue  of  Industrial  Quality  Control  (Vol.  8,  No.  5,  35-40)  was  a 
composite  and  updated  version  of  some  notes  on  these  matters  made  available 
to  SRG  staff  during  WW  II,  updated  to  Include  some  of  the  material  to  be 
published  In  the  forthcoming  (1952)  joint  paper  with  W.  H.  Kruskal,  "Use  of 
ranks  In  one-criterion  analysis  of  variance",  mentioned  earlier.  With  Harry  V. 
Roberts  (an  Associate  Professor  of  Statistics  In  the  School  of  Business)  he 
co-authored  Statistics:  A  New  Approach  (The  Free  Press,  1956),  an  84-page 
work  that  became  a  widely  used  text  In  English  speaking  countries  and  saw 
translation  Into  German  (1959)  and  Portugese  (1964).  A  paperback  version 
of  the  first  quarter  was  Issued  by  Collier  Books  (1962)  under  the  title, 

The  Nature  of  Statistics,  and  has  been  translated  Into  Swedish,  Danish, 

Norwegian,  and  Japanese. 

In  addition,  while  at  Chicago,  our  1980  Wilks  Medalist  served  as  the 
Editor  of  the  Journal  of  the  American  Statistical  Association  for  nearly  a 
decade,  1950-19159.  During  1955  he  chaired  an  Inner-University  study  group 
formed  under  the  aegis  of  the  University  of  Chicago,  and  funded  by  the  Ford 
Foundation,  to  reach  a  decision  on  the  desirability  of  a  new  or  revised 
edition  of  the  Encyclopedia  of  the  Social  Sciences  (that  had  been  published 
in  15  volumes  by  trie  Macmillan  Company  between  1930  and  1935.  (The  study  group 
Included  members  from  the  University  of  California  at  Berkeley,  Harvard  University, 
University  of  Illinois,  Reed  College,  and  Princeton  Un1vers1ty--see  David  L. 

Sills,  "Editing  a  Scientific  Encyclopedia",  Science,  Vol.  163,  1169-1175,  14  March 
1969.)  The  project  layed  dormant  for  five  years,  until  late  1960,  when  the 
Macmillan  Company  decided  to  publish  a  new  encyclopedia  of  the  social  sciences. 

The  International  Encyclopedia  of  the  Social  Sciences  (1ESS),  which  saw 
publication  In  1>  volumes  by  the  Macmillan  Company  and  The  Free  Press  In  April 
1968.  Our  1980  Medalist  served  as  Chairman  of  the  Editorial  Advisory  Board, 
and  as  Chairman  of  the  Executive  Committee,  for  this  vast  undertaking.  Unlike 
its  predecessor  and  other  encyclopedias,  the  IESS  contains  a  great  many 
articles  on  statistical  concepts,  theory,  and  methodology,  together  with 
biographies  of  a  host  of  Individuals  who  made  significant  contributions  to 
statistical  thinking  and  practice,  excluding  those  still  alive  In  the  1960's. 
Consequently,  anyone  wishing  to  know  something  about  the  contemporary  state  of 
stat1st1cs--concepts,  theory  or  methodology--or  about  their  historical  development, 
found  this  encyclopedia  a  most  useful  source.  It  proved  so  useful  In  this  regard 
that  the  articles  on  statistics  and  articles  relevant  to  statistics  published  In 
the  IESS  were  brought  up  to  date  by  the  addition  of  Postscripts  or  by  revision 
In  whole  or  In  part,  and  republished  together  with  a  few  additional  articles  and 
biographies,  as  the  International  Encyclopedia  of  Statistics,  two  volumes,  by 
The  Free  Press  in  197F! 

Our  1980  Wilks  Medalist  became  President,  Professor  of  Economics  and 
Statistics,  and  Trustee  of  the  University  of  Rochester  In  1962.  His  title  was 
changed  to  Chancellor  In  1970.  In  1975,  In  anticipation  of  retirement,  he 
turned  over  the  chief  executive  responsibilities  but  remained  In  administration. 

In  1978,  he  retired  as  an  officer  of  the  university  but  continues  there  with 
the  same  title,  Chancellor. 


He  has  been  a  director  of  Bausch  &  Lomb,  Inc.,  since  1963,  Eastman  Kodak 
Company  since  1965,  Lincoln  First  Banks,  Inc.,  since  1967,  Macmillan,  Inc., 
since  1964,  Metropolitan  Life  since  1973,  Rochester  Telephone  Corporation 
since  1964,  Standard  Oil  Company  (Ohio)  since  1977,  and  Trans  Union  Corporation 
since  1962;  and  was  for  fourteen  years  a  director  of  Esmark,  Inc.,  (1963-1977). 

He  was  a  consultant  to  the  RAND  Corporation  from  1948  until  1966;  a  member 
(1952-1953)  of  an  advisory  panel  to  the  Secretary  of  Army  on  operations  research; 
a  member  (1969-1970)  of  the  President's  Commission  on  an  All  Volunteer  Armed 
Force;  Chairman  (1969-1978)  of  the  Commission  on  Presidential  Scholars,  and 
Chairman  (1970-1971)  of  the  President's  Commission  on  Federal  Statistics,  as 
well  as  a  member  of  chairman  of  various  other  Presidential  or  national  commissions 
and  councils.  The  present-day  Committee  on  National  Statistics  of  the  National 
Research  Council  was  established  on  the  recommendation  of  "his"  President's 
Commission  on  Federal  Statistics  to  grapple  with  and  help  resolve  conflicts 
over  statistical  aspects  of  such  national  problems  as  environmental  monitoring, 
presentation  of  statistical  evidence  In  court,  effect  of  changes  In  stratospheric 
ozone  on  Incidence  of  skin  cancer,  and  recently  the  1980  Census  undercount. 

Even  more  could  be  said  about  this  Wilks-like  Individual,  but  the  foregoing 
is  more  than  sufficient  to  explain  why  the  1980  Samuel  S.  Wilks  Memorial  Medal 
Is  awarded 


To  W.  Allen  Wallis  In  recognition  of  his  extraordinary 
contributions  to  the  effective  use  of  statistical  theory 
and  methodology  by  the  armed  services  during  World  War  II, 
for  his  outstanding  contributions  to  clear  statistical 
thinking  and  effective  statistical  practice  through 
the  publications  he  authored  or  edited,  for  his  leader¬ 
ship  of  statisticians,  and  for  his  service  to  the  nation 
through  chairmanship  of,  or  membership  on  numerous  high- 
level  Governmental  and  non-Governmental  commissions  and 
councils. 
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OPTIMAL  ESTIMATION  TECHNIQUES  FOR 
FORECASTING  PROPAGATION  PARAMETERS 

K.  £.  Kunkel  and  D.  L.  Walters 
Atmospheric  Sciences  Laboratory 
White  Sands  Missile  Range,  New  Mexico 


ABSTRACT 

The  prediction  of  stochastic  atmospheric  variables  such  as  wind 
and  temperature  on  time  scales  of  6  hours  down  to  a  few  seconds  is 
being  attempted.  The  statistical  problem  Is  to  find  the  optimal  esti¬ 
mation  technique  that  combines  the  existing  climatological  data  base 
with  current  measurements  to  provide  the  best  real  time  forecast. 

An  autoregressive  technique  has  been  attempted  but  yields  results 
which  reduce  the  error  by  only  10%  over  persistence  forecasts.  Panel 
recommendations  Indicate  that  mixed  autoregressive-moving  average 
techniques  and  Kalman  filter  techniques  are  the  most  promising  to 
attempt. 

I.  INTRODUCTION 

An  Important  goal  of  meteorology  is  to  accurately  predict  the 
state  of  the  atmosphere  at  some  time  In  the  future.  For  the  large 
scale  features  of  the  atmosphere,  this  can  be  done  in  a  quasi-determlnlstic 
(if  somewhat  Inaccurate)  sense  by  predicting  the  movement  of  large 
scale  weather  systems.  However  for  some  purposes,  such  as  predicting 
the  optical  propagation  characteristics  of  the  atmosphere,  It  is  neces¬ 
sary  to  know  the  small  scale,  or  turbulent,  nature  of  the  wind  and 
temperature  fields.  The  problem  that  we  wish  to  address  here  is  the 
prediction  of  stochastic  atmospheric  variables  for  time  scales  of  six 
hours  down  to  a  few  seconds. 

II.  DATA  DESCRIPTION 

An  example  of  the  kind  of  parameter  that  needs  to  bo  predicted 
is  the  temperature  near  the  surface  of  the  earth.  Fig.  1  shows  a  time 
series  of  temperature  for  a  15  min  period  at  heights  of  3  and  33m. 

This  figure  Illustrates  the  stochastic  nature  of  the  variable.  In 
general,  the  short  time  scale  variations  cannot  be  predicted  determin¬ 
istically.  However,  the  data  can  be  characterized  In  a  statistical 
sense.  The  following  are  typical  characteristics  of  the  data: 

1)  non-zero,  non-stationary  mean  _c/3 

2)  The  power  spectral  density  follows  a  k’D/J  behavior  (k  *  wave- 
number)  for  k  >  km1n  where  km<jn  is  some  wavenumber  scale  of  the  flow. 

For  k<  km1n>  the  power  spectral  density  has  no  definite  shape.  Fig.  2 

shows  the  power  spectral  density  for  temperature  at  3  and  33m  above 
ground.  The  k_5/3  behavior  is  exhibited  for  log  k  >  -1.5.  Assuming 
ergodlclty,  k  can  be  related  to  the  frequency  domain  by  k  =  2«f/U 
where  f  =  frequency  and  U  «*  mean  wind  speed. 
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time  (SECONDS! 


Figure  1.  Time  series  of  temperature  (°C)  at  heights  of  3m  (solid) 
and  T3m  (dotted). 


-3.  -2.  -1.  0-  >' 


LOG  OF  WAVENUMBER 


Figure  2.  The  log10  of  power  spectral  density  of  temperature  vs.  log 

of  wavenumber  at  heights  of  3m  (solid)  and  33m  (dotted). 
Units  of  wavenumber  |re  rad/m.  Dashed  line  show  a  power 
spectral  density  « dependence. 


3)  Low  frequency  trends  are  often  present.  These  are  usually 
tied  to  the  dally  cycle  of  heating  and  cooling  and  occur  at  frequencies 

In  the  range  10“*  -10’®  s’^. 

There  are  two  sources  of  data  available  that  can  be  used  as  the 
basis  for  making  a  prediction.  These  are: 

1)  Climatological  data  base.  These  data  provide  general  character 
Istlcs  of  the  data  In  the  past.  These  Include  low  frequency  trends 
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tied  to  the  daily  cycle  and  typical  expected  power  spec  ral  levels  as 
a  function  of  time  of  day  and  other  external  factors. 

2)  Measurements  taken  on  the  current  day. 

The  problem  then  becomes  to  predict  a  stochastic  variable  given  clima¬ 
tological  data  and  present  measurements. 

III.  RESULTS 

One  attempt  has  been  made  to  solve  this  problem  by  using  an  adap¬ 
tive  linear  prediction  filter  similar  to  one  described  by  Keeler  and 
Griffiths  (1977).  This  Is  an  autoregressive  approach  and  can  be  briefly 
summarized  as  follows.  If  x  is  the  variable  (with  mean  removed)  to  be 
predicted,  then  the  prediction  at  time  t,  xp(t),  Is  given  by 

I 

x  (t)  *  z  g,  x(t  -  iAt) 

P  m  1 

where  x(t  -  iAt)  are  measured  values,  At  Is  the  time  Interval,  and  g, 
are  weighting  coefficients.  The  prediction  error  E ( t )  -Is  given  by 

E(t)  ■  x(t)  -  xp(t) 

Since  In  general  we  don't  know  how  to  calculate  a  priori  the  coefficients 
g^,  we  allow  the  coefficients  to  be  changed  as  data  is  collected  In 

order  to  provide  the  minimum  mean  square  error.  An  algorithm  Is  used  " 
which  updates  the  coefficients  as  each  measurement  sample  Is  collected 
by  using  the  method  of  steepest  descent.  This  Is  given  by 

g^(t  ♦  At)  =  g^t)  +  wE(t)  x(t  -  IAt) 

where 

_  _ o__ 


2 

ox  *  variance  of  x 

a  *  constant  which  determines  rate  of  convergence 

This  type  of  algorithm  was  applied  to  a  number  of  data  sets  with  At 
ranging  from  10  secs  to  15  min.  The  predictions  were  compared  with 
predictions  based  on  persistence.  I.e., 

xp(t)  *  x(t  -  At) 

By  using  a  wide  range  of  a  and  I  values,  the  best  we  could  do  was 
to  decrease  the  mean  square  error  by  0-1 0%  over  persistence.  This  Is 
not  very  encouraging. 


IV.  QUESTIONS 

Our  questions  to  the  panel  are: 

1)  Given  the  nature  of  the  data,  can  we  predict  these  quantities 
significantly  more  accurately  than  by  simply  using  persistence  or  the 
climatological  mean? 

2)  What  are  the  limits  of  predictability?  Can  these  limits  be 
calculated? 

3)  What  prediction  technique  Is  likely  to  be  most  successful  for 
this  problem?  Possible  techniques  that  have  been  discovered  In  the 
literature  are:  a)  Autoregressive  (all-pole) 

b)  Moving  average  (all -zero) 

c)  Mixed  pole-zero  (Box-Jenklns) 

d)  Kalman  type  filter 

V.  PANEL  RECOMMENDATIONS 

The  problem  Is  a  difficult  one  and  may  not  be  amenable  to  solution. 
However,  two  techniques  should  be  attempted.  One  is  the  mixed  autore¬ 
gressive  moving  average  technique  (Box-Jenklns)  for  which  software 
packages  exist.  The  other  Is  the  Kalman  filter  technique. 
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LINEAR  REGRESSION  MODEL  PREDICTION  THEORY 
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and 
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ABSTRACT .  Optimal  prediction,  within  the  normal  theory  framework,  of  one 
vector  variable  by  the  linear  functions  of  another  correlated  random  vector 
variable,  when  certain  values  on  the  predicted  variables  are  missing  is  con¬ 
sidered.  The  optimal  predictors  are  derived  hv  using  both  the  conditional 
expectation  minimization  theory  and  the  canonical  correlation  theory.  However, 
the  maximum  likelihood  estimators  of  the  unknown  parameters  are  derived  only 
for  the  canonical  correlation  theory. 

I.  INTRODUCTION.  This  paper  presents  some  of  the  author's  discussion  (as  one 

« 

of  the  panelists)  on  the  following  two  papers  presented  at  the  twenty-sixth 
United  States  Army  conference,  on  Design  of  Experiments,  held  at  New  Mexico 
State  University,  22-24,  October  1980.  The  first  paper,  "Optimal  Estimation 
Techniques  for  forecasting  Propagation  Parameters,"  was  presented  by  K.  E.  Kunkel 
and  P.  L.  Walters  of  the  United  States  Army,  White  Samis  Missile  Range,  New 
Mexico;  and  the  second  paper,  "A  Stochastic  Mesoscalc  Mctcorologic  Model,"  was 
presented  by  E.  P.  Avnra  of  the  United  States  Army,  White  Sands  Missile  Range, 

New  Mexico.  Both  t he  papers  studied  the  optimal  prediction  theory  of  one 
vector  variable  by  the  linear  functions  of  another  correlated  random  vector 
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variable,  and  the  second  paper  considered . such  theory  when  certain  sample 
values  on  the  predicted  variable  were  (missing)  unobservable.  In  his  dis¬ 
cussion  the  author  pointed  out  some  known  results  (the  derivations  given  here 
are  different),  within  the  normal  theory  framework,  to  the  above  problems. 

The  prediction  of  one  vector  random  variable  by  the  linear  functions  of 
another  correlated  random  vector  variable,  within  the  normal  distribution  theory 
framework,  is  a  very  well  known  problem  in  statistical  literature,  and  an 
exhaustive  paper  on  this  topic  is  published  by  Scobey  and  Kabe  (1980).  The 
two  classical  techniques  often  used  for  this  purpose  are:  1)  the  conditional 
expectation  minimization  theory  (CEMT) ,  and  2'  canonical  correlation  theory. 

In  brief  we  shall  discuss  these  two  techniques.  Both  are  based  on  the  fact 
that  for  two  correlated  vectors  x,y 

E(y-f(x))'(y-f(x))  ,  (1) 

is  minimized  when  c (y  lx)  =  f(x),  and  f(x)  is  the  optimal  predictor  of  y, 
for  a  given  x. 

The  kno'-n  results  of  CEMT  and  canonical  correlation  theory  (CCT)  aTe 
presented  in  the  next  section,  prediction  intervals  are  derived  in  section  3, 
and  missing  values  are  considered  in  section  4. 

Sometimes  the  same  symbol  denotes  different  quantities,  however,  its 
meaning  is  made  explicit  in  the  context. 

II,  SOME  USEFUL  RESULTS.  We  first  present  the  results  for  CEMT.  Let  y  be 
a  q  component  vector  (all  vectors  are  column  vector  and  all  matrices  are  full 
rank  matrices)  with  E(y)  =  0,  F.(yy')  *  E,  E(x)  =  0,  E(xx')  =  A, 

E(yx')  »  B,  B  q*p,  x  p  *  1,  pSq,  and  then  consider  the  minimum  value 
problem 
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Min  {Min  E  I  (/-Ax')  *  (y-Ax)  |  x) } 

A  x 

=  Min  E  {Min  [ (y-E(y |x)) ' ly-E(y| x))  (2) 

A.  x 

♦  ( (Ax-E(y  |x))'(Ax-E(y|x))|x]}  , 

which  by  using  (1)  may  be  written  as 

Min  H  (E(y-Ax) 1 (y-Ax) ! x)  ,  Ax  =  l:.(y!x)  .  (3) 

A 

However,  (3)  reduces  to 

Min  E  (y-Ax) ' (y-Ax)  =  Min  tr  (£-2AB'*AAA' ] 

A  A 

=  Min  tr  (£-RA'V  ♦  (A-RA*1)  A  (A-RA*1)'!  .  (4) 

A 

where  A  is  q  *  p.  From  (-1)  obviously 

A  =  BA'  \  i.e.  ,  AA  *  B  ,  (5) 

is  a  necessary  condition  for  our  minimisation  problem. 

Now  we  have  to  find  that  A.  which  yields  (E-AAA'')  singular.  Wc  now 
consider 


Min  tr  [~-A.‘.A'J  ,  (6) 

A 


to  find  the  minimum  of  (2).  By  our  assumption  A  satisfies 


H  =  | E-AAA ' 


|  F-ua'b  '  | 


(7) 


whence  a  solution  A  to  (7)  is 
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(8) 


AA^  *  TA*(Q'  0)',  AAA*  *  A  t  t|  ♦ 


where 


♦  X  t  t ' 
P  P  P 


Z  m  TAT'  •  (t( . t  )  A  (tl . t  )■ 


Vl*i 


.  *  X  t  t'  , 
q  o  q 


(9) 


and  Xt  >  ...  >  A  are  the  roots  of  T.  and  q  *  <;  T  is  the  matrix  of  the 

latent  vectors  of  £,  n  p  *  p  is  any  arbitrary  orthogonal  matrix.  With  A 
given  by  (8.),  find  from  (6)  that 


Min  tr  (E-AAA' 1  =  X  X 

A  p+1  q 


(10) 


Now  with  A  given  by  (8),  we  find  it  convenient  to  denote  Ax  by  y,  and 
say  that  y  optimally  predicts  y,  for  a  given  x  (actually  y  estimates 
h(vjxj),  i.e., 

-ill 

y  =  Ax  =  BA  *x ,  BA  L  =  AA  2  =  TA  2  (Q1  0)’  .  (11) 

When  A  is  given  by  (8),  equation  (11)  predicts  y  for  a  given  x  by  CF.MT, 
Thus  (8)  implies  that  CLMT  predicts  y  optimally  by  linear  functions  of  x 
by  assuming  all  canonical  correlations  between  y  and  x  are  unity,  i.e., 
from  (8)  the  generalized  variance  of  (x'  y’)’  vanishes.  (TMT  exactly  follows 
CCT  except  that  in  (XT  the  equation  (S)  is  not  satisfied.  Thus  Cl:MT  deals  with 
the  singular  normnl  distribution  theory  and  CCT  deals  with  the  nonsingular 
normal  distribution  theory.  Since  the  results  for  the  singular  normal  distribu¬ 
tion  follow  exactly  on  the  same  lines  as  for  the  nonsingular  case,  wc  consider 
parameter  estimation  for  CCT  on’v, 
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If  the  menus  of  y  and  x  arc  not  zero,  then  (11)  changes  to 


y  =  C(y)  ♦  A(x-E(x) )  =  F.(y)  ♦  BA'^x-Efx)) 


(16) 


Thus  to  predict  y  optimally  by  linear  functions  of  x,  the  population 

parameters  must  be  known .  If  the  population  parameters  are  unknown,  then  they 

are  replaced  by  their  maximum  likelihood  estimators,  when  a  sample  of  size  N 

on  1.x'  )•')'  is  available.  Thus  e.g.,  in  the  usual  nutation 

i  1 


^  S12_S21S11S12^  *  *S22^  ' I  SllS12a22S21Sll 


P  a2 

«  ls„i  it  (i-o.)  , 

i=  1 


(17) 


is  the  sample  counterpart  of  (15),  and 


*J  --  Y  -  S^Sj'jX,  MU'  *  , 


22  2 1  11  1 


(IS) 


is  the  maximum  likelihood  estimator  of 


Vfv-y)  =  V(y-Ax)  =  r  -  BaV 


(19) 


However,  t he  optimal  properties  of  such  sample  counterparts  arc  not  as  yet 
fully  lives L ; c,ited  in  statistical  literature. 

Ill.  PRHPjmON  1NTI.I1V.M.S.  Wo  obtain  prediction  intervals  for  a  single  future 
observation.  These  prediction  intervals  are  based  on  lino's  II  statistic,  see 
k  ibe  vliU.ft)  . 

I.et  a  lp  +  (|)  component  vector  (x'.y')’  luvc  a  (p*‘0  variate  normal 
distribution  with  mean  value  u,  and  covariance  matrix  T.  Then  assuming  ii,l 
to  be  partitioned  correspond  i ng  ly  wc  have 
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(20) 


E(yU)  =  u2  -  E2,EjjU,  •  E21e;',x  •  n  *  B'x  , 

as  the  linear  regression  of  y  on  x.  The  equation  (20)  is  known  as  the 

univariate  random  linear  regression  model.  We  take  (y  -  S^^Sj^x),  and 

B  =  SjjSj2  as  the  maximum  likelihood  estimates  of  n  and  B  respectively, 
where  (x',  y')'  is  the  sample  mean  vector  and  S  is  the  sample  dispersion 
matrix  based  on  a  sample  of  size  N  on  (x',  y')'.  If  (x't  y')'  is  any 

future  observation,  then  from  (16)  the  predictor  of  y  is 

v  =  v  -  S  c"^Y  *  c 

2  111  51  ^  1 '  1 1 X 

=  ^  +  S21S11 (X_X)  ■  (21) 


1  lie  joint  density  of  S  and  v  =  (x’-x’,  y'-y')’  is 

,  4(n-p-ci-:) 

5i\vl  *  K  exp  ( -  4-trE  ( S+hvv '  1 ) !  S ’  * 


where  h  =  N / ( N *  1 ) ,  and  K  as  a  generic  letter  denotes  the  normalizing  con¬ 
stants  of  density  functions  in  this  paper. 

Now  partition  S,  v,  E  *  as. 


r  .  i 

iSll  S12 


r  i 

IV, 


r  1 1  i  -n 

|EU  rK! 


I 

.V2_  , 


21  2"1 
t  1  7  \ 


'■  j  . 


and  by  setting 


"  ■  -  s:iSisi2'  “  *  *2  •  s2isii,r  r  *  s;is 


11  ’ 


write  (22 1  as 
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2(Sn,D,B,vl,u)  *  K  exp  {-^trEjjsu  -  itrE22D 

*  tr£22(B-B)  *  Sn  (B-0)  -  jhvjEj]Vl  -  ih  (u*(B-8) 'v^  • 

E22(u*(B-8) 'Vj)} 

i(N-p-q-2)  i(N-p+q-2) 

:;-i2  isnr  (25) 

Now  integrate  out  B  and  find  the  density  of  u,  v^,  S^,  and  0  to  be 
g CS x x  ,D ,u ,v =  K  exp  {- jtrEj|sn  -  y  trI‘2D 

-Ihvirnvi  -  ihu-i^u/d^hvjs;1^)} 

-iq  ilN-p-q-2)  ifN-p-2) 

( 1+hv  jS  Vj)  ‘  jnr  iSjj!*  (26) 


and  hence  the  prediction  intervals  for  y,  for  a  given  x,  can  be  based  on 
(28). 


IV.  MISSING  OBSERVATIONS.  If  a  sample  of  site  k  (X'  Y')’  is  now  available 
on  (x',y')',  and  a  sample  of  site  (N-K),  N  z  K  >  q  *  1  is  later  available 
on  ( x *  then  the  relations  between  the  maximum  likelihood  estimators  of 
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the  original  sample  and,  the  total  sample  are  available  in  the  literature,  see 
e.g.,  Kabe  (1967).  The  results  given  by  Kabe  (1967)  can  be  easily  modified  to 
suit  the  missing  value  prediction  theory. 

We  now  state  the  problem  proposed  by  E.  P.  Avara  and  its  possible  solution 
outlined  by  the  author.  The  first  sample  is  (X1,  Y ' ) ’  and  the  second  sample 
is  (X^,  01 ' ,  as  no  observations  on  y  were  recorded  in  the  second  sample. 

The  problem  is  how  <joe<  the  entire  theory  of  optimal  prediction  be  carried  on 
under  such  circumstances. 

The  fact  that  the  problem  of  missing  values  forms  a  significant  line  of 
research  is  known  in  the  statistical  literature.  However,  the  extreme  diffi¬ 
culty  involved  in  its  mathematical  treatment  is  the  cause  of  its  not  being 
thoroughly  investigated  as  vet. 

Let  x  denote  the  mean  of  the  entire  sample  and  x  the  mean  of  the  first 

K  observations.  If  S  j  ^  .  S,.,,  and  [  S*,  arc  the  old  ar.d  new 

maximum  likelihood  estimates  of  Ep,  Ep,  Ep  respectively,  then  a  relation 

between  old  and  new  estimates  is  desired.  Obviously  S*  =  (X..X ' -Nx.,x\,)/N  is 

11  N  N  IX  N 

the  maximum  likelihood  estimate  of  Ep.  Further  from  (21') 


c-  I , 
s..:xv-x,) 


2111 


is  the  pred’etor  of  y^,,  and  hence  is  the  maximum  likelihood  estimate  of  C(y). 
The  constant  h  =  K/(N-k)  from  Kabe  (li’67).  To  derive  the  relations  between 
the  old  and  new  maximum  likelihood  estimates  we  first  consider  the  old  data 
representation.  If  1J .  =  [I,  0|,  V  =  (0,  I),  then  the  sample  is  represented 
by 
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(30) 


1 


Y  -  S21s'i*  *  (S22-S21s;,'si2)!  V 


(31) 


However,  the  entire  sample  must  be  represented  by 

1 

2 

Y  a  V  |i* 

N  U 


- 1  - 
y  -  s*  S'  -1  1 

M  2111  \  ♦  is>  .<«  s  , “  v * 


(3?) 


However,  s*iiicc  there  are  no  new  observations  on  (.32)  must  reduce  to 


l 


L 


v  .  s5ls;;'x  .  cs,2-s21su's12)*  v  . 

1 


(33) 


Note  that  V  and  its  coefficient  do  not  change  because  Y  does  not  change. 
It  follows  from  (31)  and  (33)  that 


S21Sjl  =  S2  1S 1 1  ’ 


(34) 


and  hcncc 


S21/N  =  S21SjlSll/N  ’ 


(35) 


is  the  maximum  likelihood  estimate  of  I,  ,  Aim i n  from  (33) 

£.  \ 


s5r’h‘sIr/N  *  >sk-Sjish*,j>'* 


(36) 


is  the  in, i.\ i mum  likelihood  estimator  of  V(y)  = 

This  research  is  supjiortcd  by  a  National  Research  Council  of  Canada 
grant  A-401S. 
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ABSTRACT.  We  have  developed  a  robust  data  smoothing  method  which  was 
motivated  by  the  necessity  of  extracting  a  small  but  nevertheless  Important 
signal  which  is  imbedded  in  a  large  band  of  noise.  It  is  assumed  that 
nothing  is  known  about  the  signal  other  than  that  it  is  of  significantly 
lower  frequency  than  the  noise  in  which  it  is  imbedded.  The  noise  variance 
may  vary  over  a  rather  large  range.  The  signal  to  noise  ratio  may  also 
vary  over  a  large  range,  sometimes  the  signal  will  predominate  but  usually 
the  signal  will  be  almost  invisible  in  a  large  band  of  noise.  We  adapt 
Tukey's  idea  of  using  medians  to  smooth  data  for  exploratory  d»ta 
analysis  to  develop  our  robust  smoother  for  extracting  this  small  signal 
from  noise.  If  Z^,  k=1 ,  2 —  are  the  measured  values  of  signal  plus  noise, 

the  smoothed  values  of  this  time  series  are  given  by 


\  *  median  {Zk-i*  Zk-1+1’  *  Zk+1} 


1  *  0,  N 


where  N  Is  variable  and  Is  made  data  dependent  by  choosing  N  as  a  function 
of  locally  computed  values  of  the  signal  and  noise  statistics.  The 
application  of  the  robust  adaptive  smoothing  technique  is  illustrated 
on  some  WSMR  data  sequences. 

INTRODUCTION.  Median  smoothing  has  been  strongly  advocated  by 
Tukey  (1)  and  [2]  as  a  tool  for  exploratory  data  analysis.  Suppose 
we  have  a  noisy  time  sequence  of  measurements,  x^,  1=1,2, —  ,  which 

we  want  to  smooth.  Median  smoothing  of  this  measurement  sequence 
basically  means  to  compute  a  smoothed  value  at  any  time  t^  by  the  median 

of  the  measurements  about  t^.  More  specifically,  the  smoothed  value 

at  time  tj,  denoted  by  x^,  is  computed  by 

xi  =  med  {xl4.}  (1) 

1  j-O.N  1±J 


!  The  smoother  in  (1)  has  a  smoothing  span  of  2N+1  points. 
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Sever ul  advantages  of  median  smoothing  are  readily  apparent.  Median 
smoothing  is  very  simple  to  implement  since  it  only  requires  the  use  of 
a  subroutine  which  will  order  the  measurements.  Median  smoothing  does 
not  require  the  specification  of  a  model  of  either  the  signal  or  noise, 
i.e. ,  it  is  a  nonpar?  -etrir.  method  as  opposed  to  most  other  smoothing 
methods.  Median  sitH.-ot;-:**,/  i:  robust.  It  tends  to  reject  spurious  values 
or  outliers  in  the  .e as-..  -wents.  An  outlier  or  short  burst  of  outliers  in 
the  measurements  wi.l  appear  in  the  smoothed  output  if  the  length  of 
the  burst  is  smaller  thu..  *'  1  points.  Median  smoothing  has  been  applied 
to  speech  processing,  [3]  and  [4],  and  to  image  processing,  [5]  and  [6]. 
Our  motivation  for  the  development  of  an  adaptive  median  smoothing  routine 
was  for  the  smoothing  of  radar  error  signals. 

SMOOTHING  RADAR  ERROR  SIGNALS.  Let  R  (t-1.  A  (t,.).  and  E  (t.l  be 

■ - - - - -  O  1  O  1  O  1 

the  range,  azimuth,  and  elevation  output  values  of  a  radar  at  time  t^  when 

tracking  a  target.  These  output  readings  specify  a  point  in  space  at 
which  the  radar  is  pointing.  These  values  are  usually  close  to  the  true 
target  range,  azimuth,  and  elevation  values,  which  we  denote  as  R(t^), 

A(tj),  Eftj).  The  target  tracking  errors  are  defined  as 
»'T(t1 )  •  R(ti )  -  R^tj) 


aT(ti)  =  AC^)  -  A0Ct1)  (2) 


eT(ti )  -  E (t 1 )  -  EQ(ti ) 


Measured  values,  r(t^),  a(t^),  and  e(t^)  of  the  tracking  errors  are 

available.  These  measured  values  of  the  tracking  errors  are  called  the 
radar  error  signals.  These  errors  signals  are  usually  very  noisy  compared 
to  their  signal  content.  We  want  to  obtain  smoothed  values, 

r(t^),  a(t^),  and  e(t.)  of  the  radar  error  signals  In  order  to  construct 

Improved  measurements,  Rm ( t ^ ) ,  AnJ ( t i ) ,  A^(t^),  of  the  targets  range, 

azimuth,  and  elevation. 

MV  =  MV  +  r(V 

VV  =  W  +  a(V  0) 

MV  e  MV  +  *<V 
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Median  smoothing  appears  to  be  an  inmediately  applicable  method  which 
can  be  quickly  Implemented  for  the  task  of  constructing  smoothed  values, 

r(t,),  a(t1 ) ,  eUj)  from  the  measured  error  signals,  r(tj),  a^),  e^). 

However,  when  one  attempts  to  directly  apply  median  smoothing  to  the 
error  signals,  some  common  characteristics  of  radar  error  signals 
reduce  the  quality  of  the  smoothed  output  to  such  an  extent  that  the 
attempt  is  unsuccessful.  A  constant  span  median  smoother  cannot  be 
applied  successfully  to  smoothing  radar  error  signals  because  the 
signal  vs.  noise  content  of  the  measurements  varies  over  such  a  wide 
range  during  a  mission.  Fig  1  gives  an  indication  of  this  wide  variation. 
Initially,  when  the  radar  is  acquiring  the  target.  Fig  1  shows  a  rather 
strong  signal  content  as  compared  with  noise.  This  portion  of  the 
measurement  sequence  would  require  a  short  span  smoother  in  order  to 
preserve  the  signal  characteristics.  In  the  later  portion  of  Fig  1 
the  range  of  the  target  from  the  radar  is  increasing  and  the  elevation 
angle  may  be  quite  low  resulting  in  a  very  large  noise  content  relative 
to  any  signal  which  may  be  present.  This  portion  of  the  data  would 
require  a  large  smoothing  span  in  order  to  filter  out  the  large  amount 
of  unwanted  noise.  Thus,  the  characteristics  of  the  radar  error  signals 
force  the  use  of  a  variable  span  median  smoother  where  the  span  at  time 
t.j  must  be  dependent  on  the  relative  content  of  signal  and  noise  at 

times  near  t..  We  call  the  result  an  adaptive  median  smoother. 

ADAPTIVE  MEDIAN  SMOOTHING.  An  adaptive  median  smoother  is  defined 
by 

x.  *  med  lx.  .  }  ,  NMIN<N.<NMAX  (4) 

1  j-0,N1  1±J  ' 

The  choice  of  Is  based  on  the  measured  values,  x j ,  where  t^  is  near 

t.j.  The  maximum,  NMAX,  and  the  minimum,  NMIN,  values  of  can  be 

specified  by  the  general  characteristics  of  the  data  and  thru  experience. 

The  definition  given  in  (4)  obviously  does  not  specify  how  to  obtain 
smoothed  values  near  the  beginnings  and  ends  of  data  sequences.  At  the 
start  and  end  of  data  strings  we  use  the  simplest  possible  smoothing: 

START 

*1  “  X1 
A 

x2  =  med  {x-j ,  x2,  x3>  ^ 

x.  =  med  {x,,.,,}  1 <MMIN 

1  j-0,1-1  1±J 
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END 


xL_j  «  med  (xL.2*  x|_-l*  xlj  (6) 

x,  4  °  med  {x.  4  4)  i<NMlN 

L‘j  j*0,i  L*1±J 


In  (6)  the  subscript  L  denotes  the  last  point. 

METHODS  OF  ADAPTATION.  In  order  to  adapt  the  span  of  the  median 
smoother  to  the  local  characteristics  of  the  data  sequence  at  each  time 
point  t.,  we  examine  the  residuals  in  the  vicinity  to  determine  if  there 
is  some'signal  remaining  in  the  residuals  indicating  that  we  have  been 
oversmoothing  and  should  shorten  the  span  or  whether  the  residuals 
exhibit  a  random  behavior  indicating  that  we  could  possibly  lengthen  the 
smoothing  span.  We  have  used  two  different  measures,  the  serial  correlation, 
and  the  Von  Neumann  ratio  to  examine  the  residuals  for  trends.  In  using 
the  serial  correlation  we  have  tried  both  the  usual  parametric  definition 
and  also  a  nonparametric  correlation  coefficient  which  will  serve  to 
preserve  the  robustness  of  the  overall  method. 

Let  t4  be  the  current  time  at  which  a  smoothed  value  is  being  computed 
and  let  J=l»  NMAX  be  the  residuals  from  the  smoothed  values 

ri-j  =  xi-j  -  *i-j 


Let  ?  *  ave  {r4  ,}  be  the  sample  average  of  the  residuals. 
j*l.N1  1-J 


The  usual 


S  « 


definition  of  the  serial  correlation  coefficient  is  given  by 

V1 

J,  (ri-j+r‘r>(ri-rr) 


r 

l 


(8) 


* 
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If  S, -S- $  wc 

conclude  that 

there  is  no  reason  to  believe  that  the 

L-  -  u 

residuals  are 

serially  correlated.  In 

this  case  we  can  increase  the 

smoothing  interval,  i.e.,  we 

set  NS**- 

— NS+2 ,  subject  to  NS<(2NMAX+1) 

If  S<S.  or  S>5  ,  we  conclude  that  the 

residuals  are  correlated  and  may 

contain  a  significant  signal 

component  because  of  over  smoothing.  In 

this  case  we 

set  NS-* - NS-2 

subject  to  NS> (2NMIN+1 ) .  The  upper  and 

lower  limits 

for  large  are 

-1 

+  1.645«C^Z‘ 

-1  -  1.645^7? 

C  a  _ 

and 

c  1 

Su 

"c< 

\  N.-l 

For  N^<20  we 

use  the  tables 

N 

Su 

SL 

5 

.253 

-.753 

6 

.345 

-.708 

7 

.370 

-.674 

8 

.371 

-.625 

9 

.366 

-.593 

10 

.360 

-.564 

11 

.353 

-.539 

12 

.348 

-.516 

13 

.341 

-.497 

14 

.335 

-.479 

15 

.328 

-.462 

16 

.322 

-.446 

17 

.316 

-.432 

18 

.310 

-.420 

19 

.304 

-.409 

20 

.299 

-.399 

In  order  to  preserve  the  robustness  of  the  median  smoother,  we  should  use 
an  adaptation  test  which  is  itself  robust.  This  is  easily  achieved  in  the 
serial  correlation  case  by  using  a  rank  correlation  coefficient  in  place  of 
the  ordinary  serial  correlation  coefficient  given  in  (8).  Let  R(j )*rank(r^+j ) , 

Vjc{ri*k*ksl’Nr1}  and  let  Ri(j)=rank(ri+j+1),  r1+j+1c{ri+k+1,k-1,Hrl}. 

Then  with  d .=R, ( j )-R(j )  we  compute  the  Spearman  rank  correlation  coefficient 

J  ■ 


as 


lyi 


l  - 


e  i  y 


Ni(N1-l)(Ni-2) 


>42 


(9) 


i  .  .  „  .  i..^ito|ii' i«HtM4kwa*.b U.  vJ.W  ^ - L“1Tlii'tf>  ■-- - 


ff 

i  % 


If  S  <S  >S.  wc  conclude  that  there  is  no  reason  to  believe  that  the 
u-  p-  L 

residuals  are  serially  correlated.  Thus,  we  increase  the  smoothing 
Interval  by  NS^i - NS  +  2,  If  either  S  <S  or  S  <S^  we  decrease 


the  smoothing  Interval  by  NS*« - NS 

the  upper  and  lower  limits  are 


2.  For  large  values  of 


,  1.645 


and  S^ 


-1.645 

vTTpT 


For  values  N^<10  we  use  the  following  table 


5 

1 

-1 

6 

.9 

-.9 

7 

.771 

-.771 

8 

.679 

-.679 

9 

.643 

-.643 

10 

.600 

-.600 

Another  useful  method  for  adjusting  the  smoothing  span  is  the 
Von  Neumann  ratio  which  is  the  ratio  of  the  mean  square  successive 
difference  to  the  variance.  Specifically, 


V 


c 


N1-l 


4^ 

ff. 


i 


l  (r..  -  ?)2 

j=l  1  J 


00) 


Then  If  VL<V<Vu  we  decide  that  there  is  insufficient  evidence  that  the 

residuals  are  correlated  so  that  we  then  increase  the  smoothing  interval 
by  NS  - NS-»2.  If  either  V<V^  or  V>Vuthe  residuals  show  evidence  of 

being  correlated  so  that  we  decrease  the  smoothing  interval  by 

NS-* - NS-2.  The  upper  and  lower  limits  are  sample  size  dependent  and 

are  chosen  by  the  following  table. 
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N1 

$. 

S 

L 

u 

5 

1.0255 

3.9745 

6 

1.0682 

3.7318 

7 

1.0919 

3.5748 

8 

1.1228 

3.4486 

9 

1.1524 

3.3476 

10 

1.1803 

3.2642 

11 

1.2062 

3.1938 

12 

1.2301 

3.1335 

13 

1.2521 

3.0812 

14 

1.2725 

3.0352 

15 

1.2914 

2.9943 

16 

1 . 3090 

2.9577 

17 

1.3253 

2.9247 

18 

1.3405 

2.8948 

19 

1.3547 

2.8675 

20 

1.3680 

2.8425 

EXAMPLE- SMOOTHING  RADAR  ERROR  SIGNALS.  Figs  2-19  present  the  application 
of  adaptive  median  smoothing  to  smoothing  of  range,  azimuth,  and  elevation 
tracking  error  signals  from  a  WSMR  radar.  The  minimum  smoothing  interval 
was  11  points  while  the  maximum  smoothing  interval  was  41  points.  The  method 
used  to  adapt  the  smoothing  interval  to  the  data  was  Spearman  rank  correlation 
coefficient. 

Figs  2-3  present  the  raw  range  error  signal.  Figs  4-5  show  the  smoothed 
range  error  signal  and  Figs  6-7  show  the  range  residuals.  The  range  tracking 
error  does  not  exhibit  significant  signal  content  so  that  the  smoothing  of 
this  signal  is  quite  uninteresting.  Note  that  the  smoothed  range  in  Figs  4-5 
show  bumps  and  dips  having  flat  tops  and  bottoms.  These  flat  peaks  and  valleys 
are  characteristic  of  median  smoothing.  Tukey  suggests  a  method  of  removing 
these  peaks  and  valleys  but  we  have  not  attempted  to  implement  this  in  our 
median  smoothing  routine.  Note  also  that  the  smoothed  range  exhibits  a  very 
constant  behavior.  This  constant,  which  is  not  zero,  may  be  due  to  the 
granularity  of  range  output  readings. 

Figs  8-9  are  the  azimuth  tracking  error  signal.  At  the  beginning  the 
radar  is  acquiring  the  target  so  that  the  error  signal  has  a  very  strong 
signal  content.  After  the  target  has  been  acquired  the  signal  level  decreases 
drastically  and  as  the  target  recedes  from  the  radar  the  noise  content  of  the 
tracking  error  Increases  until  It  is  virtually  one  large  band  of  noise. 

Figs  10-11  are  the  smooth  azimuth  tracking  error  and  Figs  12-13  exhibit  the 
azimuth  tracking  residuals. 


144 


Figs  14-15  are  the  elevation  tracking  error.  Again,  this  tracking 
error  indicates  a  strong  signal  when  the  target  is  being  acquired.  Near 
the  end  of  the  mission  the  noise  amplitude  becomes  very  large.  Also,  near 
the  end  of  the  track  the  elevation  tracking  error  indicates  again  a  strong 
signal  content.  Figs  16-17  present  the  smooth  elevation  tracking  error 
and  Figs  18-19  are  the  elevation  tracking  residuals. 

CONCLUSIONS.  Adaptive  median  smoothing  is  a  very  simple  method  which 
can  be  readily  applied  to  smoothing  almost  any  data  sequence  without 
modeling  either  the  signal  or  noise  characteristics  of  the  sequence.  Adaptive 
median  smoothing  is  also  robust  with  respect  to  outliers.  When  applied  to 
smoothing  radar  tracking  errors  as  in  the  example  given  above,  adaptive 
median  smoothing  does  remarkably  good  In  extracting  the  signal  from  the 
noisy  data  sequence.  Some  additional  features  of  Tukey's  proposals  for 
median  smoothing  remain  to  be  implemented  and  tested  in  our  adaptive  median 
smoothing.  We  plan  to  test  the  use  of  repeated  median  smoothing  and  the 
technique  of  twicing,  i.e.,  resmoothing  the  residuals  in  our  adaptive  median 
smoothing  routine. 
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FITTING  AN  ELLIPSE 


Donald  L.  Buttz 

US  Army  White  Sands  Missile  Range 
White  Sands  Missile  Range,  NM  88002 


ABSTRACT 

Three  program  procedures  to  fit  on  ellipse  to  x,  y  data  constrained 
by  two  criterias  are  described.  The  first  procedure  is  an  iterative 
approach  and  the  remaining  two  procedures  are  of  a  statistical  nature, 
using  a  line  of  regression. 

1.  Introduction 


Beginning  with  an  impact  pattern  plot  as  points  on  an  x,  y  graph,  the 
problem  is  to  evaluate  an  ellipse  of  minimum  area  circumscribing  95*  of 
"activated"  submunition  impacts.  The  first  procedure  hereafter  named  PARAM 
is  an  iterative  procedure.  The  second  procedure  known  as  SELLIPSE  is  a 
statistical  procedure  and  the  third  procedure  labeled  ELLP3  is  also 
statistical  in  nature.  However,  ELLP3  is  more  a  probability  approach 
to  fit  an  ellipse  of  minimum  area  to  a  plotted  impact  pattern. 

The  problem  of  fitting  an  ellipse  to  impact  pattern  arose  as  a  scoring 
criteria.  The  two  criterias  for  that  scoring  are  that  the  area  of  the 
ellipse  must  be  the  smallest  while  secondly  containing  exactly  95X  of  the 
"activated"  submunition  Impacts. 

The  result  of  every  procedure  must  provide  the  area  of  the  ellipse, 
the  length  of  the  major  and  minor  axes  and  the  angle  of  axes  rotation. 

This  paper  shall  describe  the  program  procedures  PARAM,  SELLIPSE  and 
ELLP3  in  that  order  and  then  a  summary  of  comparative  results. 

2.  Program  Procedure  Called  PARAM 

The  notation  and  relationships  expressed  below  apply  to  the  following 
discussion: 


A  Is  the  semimajor  axes 
B  is  the  semi  minor  axes 
C  is  the  ellipse  focal  point 


ELLIPSE  CENTER  AT  ORIGIN 


Standard  equation  of  an  ellipse: 


1. 


A2 


zi 

B2 


ELLIPSE  WITH  CENTER  AT  (h,k)  AND  NEW  AXES  X’  AND  V 

,  2  ,2 

v "  y 

The  equation  of  the  curve  relative  to  these  axes  is  — ^ 


1. 


The  equation  relative  to  the  x  and  y  axes,  by  setting  x'  =  x  -  h  and 
y'  =  y  -  k  becomes,  (x-h)^  +  { y— k ) =  1 

~U~ 


This  is  the  standard  equation  of  the  ellipse  with  center  at  (h,k)  and  major 
axis  is  parallel  to  the  x-axis. 


cos(a) 


*1 

R 


cos (a) 


zl 

R 


let  it  be  noted  that  for  purposes  of  this  procedure  an  activated  bomblet 
will  be  defined  to  lie  within  the  ellipse  if  and  only  if  the  sum  of  its 
distance  from  (C,0)  and  (-C,0)  after  adjustment  for  axis  rotation  is  less 
than  or  equal  to  "2A",  Secondly  each  fitted  ellipse  will  be  centered  at 
the  mean  center  of  impact  in  the  rotated  coordinate  system,  (h,k). 
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l'AH/.K  PROGRAM  FI  OWCHART 


PARAM  IS  AN  ITERATIVE  PROCEDURE  FOR  FITTING  AN  ELLIPSE  OF  MINIMUM  AREA  TO 
PLOTTED  IMPACT  PATTERN  DATA 


N  to 


After  iterating  until  9jhas  been  adjusted  through  179  degrees 
tlie  ellipse  of  minimum  area  is  determined.  The  equation  of  the  ellipse 
of  minimum  area  which  contains  95 »  of  the  activated  submunition  impacts  i 

(x*-h)2  4.  (y 1  — k ) 2  =  1 
A2  B* 


wl.t re  A  and  B  were  saved  from  the  comparisons,  and  the  rotation  of  the 
coordinate  axis  system  is  specified  by  0j  • 

3.  Fj'oj'i  am  Procedure  Called  SELLFF’SE 


SELL1PSE  computes  the  slope  M  and  the  y  intercept  b  from  the  normal 
equations  for  linear  regression. 


The  dashed  figure  indicates  the  possible  ellipse  to  be  determined. 


i.ext ,  SLLL1PSE  transforms  to  center  of  x,  y  distributions.  r  _ 

v*  i-l*i 

1  1  Y  I  xcen  • 


1 

1 

/'M 

- 7 

j. 

** 

ycen 


i-iyi 


coordinates.  Y 

y'1  ■  -x'sina  +  y'cosa 


x'coaa  +  y’sina 


x'  -  x  - 

-  xcen;  y'  > 

to  obtain  the  x"  and 

1* 

/  -  - 

\ 

) 

y  - 


SELLIPSE  computes  the  standard  deviations  of  elliptic  distribution  along 
x"  and  y". 


Cx” 


A 


ycen 
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The 


o 

X 


o 


y 


standard  deviation  in  terms  of  x  and  y 

.  i  /  -Aryl  (NX*2  -  (Xx)2)  +  M2  (NIy2  - 
N  v  1+M 

-  -  v/^~tIM2 (NXx2  -  (Xx)2)  +  (NIy2  - 

N  r  l+M2  * 


respectively  are 

(Xy2)2)  +  2M(NXxy  -  Xxly) ) 


(Xy)2)  -  2M(NXxy  -  XxXy) ) 


The  equation  of 


the  ellipse  in  terms  of  x"  and  y"  is 


1 


where  A  is  the  semimajor  axis  and  B  is  the  seraiminor  axis 
The  ratio  A/o  is  k  and  the  ratio  of  B/o  is  k. 

x  y 

A  *  ko  and  B  -  ko 
x  y 

where  k  is  the  percentile  radius. 

SELLIPSE  does  a  computation  to  circular  coordinates. 


xi  *<2  +  yi«i2  —  p2 

then  R  =  /  x'  ' ' 1  2 +  y'  "2 


where  R  is  the  radius. 

SELL1PSE  computes  the  mean  and  standard  deviation  of  the  radius. 

N 


mean 


baati reaaa 


M-  V  •  >•  -* 


I’ciTcnliJi  Radius  in  defined  as  follows 
R%  -  HMEAN  + 

AX  K 

where  K^^for  a  normal  distribution  equals  1.645  for  95%  ellipse. 

In  a  normal  probability  distribution  function,  f(x)  is  the  probability  density 

where  f(x)  -  (l//?n)exp(-(l/2)x2 J  . 

For  negative  values  of  x,  one  uses  the  fact  that  f(-x)  «  f(x). 

Also,  let  F(x)  be  the  cumulative  distrubtion  function.  Therefore: 

F(x)  -  f  -i-  expl-(l/2)t2jdt 
'  -«•  /2* 

For  F (x  1  i  95%  x  becomes  1.645  our  K  Note  that  R%  =  RMEAN  t  It  o  . 

K  /w>  A  «  A 

The  semi  major  axis  is  A  =  R%°x 
and  the  semi  minor  axis  is  B  ;  R%0y‘ 

The  rotated  angle  is  a  =  TAN~lM). 

The  equation  of  the  ellipse  is  then 

x"2  ♦  y"2  =  1 
A*  B2 
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SELL  IPSE  PKOGRAM  FLOWCHART 


READ  A,Y 


_ i _ 

EVALUATE.  tiOAMAL  eUCATlO^ 
F  OR  LINEAR.  REiiRES^CN  Kt,  B 


IEVAcUATF  STAk'5  (|RD  5E  1/1A  Vif'kl^ 
;  CL  Cl'  r>  Vie  DiS  VRIBU  TtOK) 

*  <rx  A  Mt>  <r* 

I  J 


READ  X,Y 


EVALUATE 
DEViAT-oM  Cf  CIRCULAR 
D  IS  TP  IB  JTiO  M  R,<tr 


i _ 

RA  I'mIS  AMP  i  FA^O  ARC 


[""tTv  ALUAT£  RMEA*  A  AID  o/^TUS. 
I  RA  LlUi 


e  v  alum  e 

SKE  W  WE4v  AMD 
K  URTC51S 


|pRifJT  ^jC,crA/cr 
RME  AM,  K  RVc,  C"R 
R7e  ,  A-  A  X'7/  R  -Aj/Y 
THETA,  XC  c  W,  YC  EM, 
Xml'RT,  YKuRT, 
X5K6  w,  YS<evO 


r 


4*  Program  Procedure  Called  ELLP31 


Assumptiona  and  derivations  will  be  discussed  first. 

The  data  are  presented  as  N  ordered  pairs  (x  ,y  )  representing  locations 
of  subounition  impacts  in  a  north-south/east^weit  rectangular  coordinate 
system,  x  and  y  are  assumed  to  be  jointly  distributed,  normal,  random 
variables  with  respective  means  M  and  M  and  non-zero  variances  estimated 
oy  and  S*  ,  where  x  y 

*  y  .  N 

-2  .  ~  j> 

N  -  1  *  where  1  *  X  ,  y. 

There  may  exist  between  x  and  y  a  non-zreo  correlation  whose  coefficient. 
R,  is  estimated  by 

N  N  N 

,  i2lXiyi  *  iSlXi 

R  *  (N  -  1)S  S  “  • 

x  y 

The  Joint  probability  density  function  is  approximately 


Vx,r) 


rQ<x*y) 


2ttS  S  A  -  R2 
x  y 


where 


X  -  H  y  -  M 


y  -  ", 


a 

"Modern  Probability  and  its  Applications",  by  Emanuel  Parzen,  (John  Wiley 
&  Sons,  New  York).  } 


tn(H2  >  .]>  ;  ;-(T  -R2)l(— 


x  -  M  x  -  M  y-H  y  -  M 

1(-  -  -  2R(~r— 3 -)(— g-*)  +  V) 

**x  x  y  y 


By  translating  axis  so  that  the  new  origin  is  at  the  joint  means  (M  ,  M  ) 
and  simplifying,  the  equation  becomes:  x  y 

-  2(1  -  R2)in(P[Z  >  z])S2S2  i  S2x'2  -  2RSs  *V  +  S2y’2 

x  y  y  x  y  * 

Ey  rotating  the  axis  about  the  new  origin  and  simplifying,  the  equation 
becomes: 


-  2(1  -  R2 ) In (P 1 2  >  z])S2S2  i  x"2(S2cos2£-  2RS  S  <inecc«sb  4  S2sin26)  + 

x  y  y  x  y  x 


y"  (S2sin26  4  2RS  S  sinBcosB  4  S2cos26)  + 
y  x  y  x 

+  x"y  " (-2S2sin6cos0  -  2P.S  S  cos26  4 
y  x  y 

4  2RS  S  sin29  4  2S2sinBcos6) 
x  y  x 

setting  the  coefficient  of  x"y"  equal  to  zero  yields  the  following 
for  6 : 

.  _  2S  S  R 

6  *  ^tan 

x  y 

Simplifying,  to  derive  the  equation  of  the  ellipse 


2(1  -  R2)(-*n(PlZ  >  z] ) )S2S2 

*  V 


S2cos2e  -  2RSvSvsin6cose  4  S2sin20 
y  *  y  A 


2(1  -  R2)<-ln(P[Z  >  z]))S2S2 

x  y 


S^sin^B  4  2RS  S  sinBcosB  +  S^cos^B 
y  x  y  x 
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where 


2(1  -  R2)(-In(PlZ  >  z)))S2sJ 

SyCos^e 


2RSvSusm0c°se  4  S^sin^B 

X  /  * 


B  - 


2(1  -  R2)(-£n(PlZ  >  z)))Sjs2 
*Szsin^8  +  2RS  S  sinBcosB  +  S*cos20 


x  y 


The  ellipse  is  plotled  with  center  <MX »MV )  with  semi  maj°r  axis  A* 
with  semi  minor  axis  B,  and  with  rotation  0  relative  to  the  usual 
north-south/east-west  coordinate  axis  system.  The  constants  R,S  , 

S  ,  Sx2,  and  Sy2  are  all  computed  from  the  coordinates  before  rotation. 

y 

The  following  discussion  applies  to  deriving  of  PPOGRA.M  ELLP3  demon¬ 
strating  the  intermediate  steps. 

Recall  that  we  translated  the  old  axis  to  the  new  origin 
(x,y)  x  -  x'  +  Mx  (x’,y‘) 

OLD  y  -  y*  +  My  NEW 

then  in  the  new  axis  our  origin  is  (0,0)'. 


As  a  result  of  translating  we  Ret: 

-2(1  -  R?)«n(P[Z  >  z])S2S2  i  S2x,Z  -  2RS  S  x’y'  +  sV2 

x  y  y  x  y  '  x 

setting  the  left  hand  side  of  the  equation  to  K,  e  constant,  wc  get: 

K  *  S2x' 2  -  2RS  S  x’y'  +  S2y'2. 
y  x  y  x 

We  do  a  rotation  of  axes  thru  9  with  respect  to  the  old  axes  about 

(M  , M  )  which  is  now  (0,0)' 
x’  y 

(x'.y*)  x'  -  x"co80  -  y''sin0  (x",y") 

TRANSLATED  y'  .  x"ain0  +  y"cos0  ROTATED 


K  •  S2(x"cos9  -  y"sin9)2  -  2RS  S  (x"cos0  -  y"sin0) (x"sin0  +  y"cJo6) 

y  x  y 

+  S2(x"sin9  +  y"co80) 2 

K  -  x"  (S2cosie  -  2RS  S  Sin0cos0  +  S2sir>20)  +  x"y"(-2S2sin0cos0  - 
y  x  y  x  y 

-2RSxSycos20  +  2RSxSysin29  +  2RS2sln9cos0)  + 

+y"2(S2sin29  +  2RSxSyain0coae  +  S2cos26)  . 

Take  the  x"y"  coefficient  and  set  it  equal  to  zero 

-2S28ln6cosO  -  2RS  S  cos26  +  2RS  S  sin20  +  2S2sin0cos9  * 
y  x  y  x  y  x 


0. 


Using  the  identities  sin20  ■  2sin9cos9  and  cos29  ■  cos20  -  sin2©  one  obtains 


(S2  -  S2)sin20  -  2RS  S  cos29  -  0. 

x  y  x  y 

Making  use  of  the  identity  sin26  **  tan26cos20  this  becomes 
cos20[tan29(S2  -  S2)  _  2RS  sj  -  0. 

If  the  product  of  two  numbers  is  zero,  one  or  the  other,or  both  are  zero. 
In  case 


tan26 

then 


2RS  S  /<S2  -  S2) 
x  y  x  y' 


0  -  y  tan'1[2RSxSy/(S2  -  S2) ] . 


We  use  this  value  for20  and  eliminate  the  x"y"  term.  Thus  leaving  terms 

containing  x"2  and  y"  with  coeficients  we  indicate  by  c  „  and  c  where 

x  y 

c  „  “  S2co820  -  2RS  S  sinOcosS  +  S2sin20 

x  y  x  y  x 


c  „  ■  S2sin20  +  2RS  S  sin0cos0  +  S2cos20 

y**  y  x  y  X 

and 

K  -  -2(1  -  R2) £n(P [Z  >  z])S2S2  . 

a  y 

With  the  elimination  of  the  x"  y"  term  we  get  the  equation  of  the  ellipse 
in  the  translated  rotated  axes: 


K 


tl 


J.  —  ■ - —  ~r  -  , 

K/ CjjU  K/Cy” 

We  see  from  this  that  the  semimajor  axis  A  and  the  serairainor  B  have  the  values 


A  “  /k/cx„  ,  B  =■  /k/Cy,, 

when  the  rotated  angle  6  is 

0  -  \  tan'1[2SxSyR/(sJ  -  S2)]  . 
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ELLP3  PROGRAM  FLOWCHART 


ELLP3  IS  A  PROBABILITY  APPROACH  TO  FIT  AN  ELLIPSE  OF  MINIMUM  AREA  TO 
PLOTTED  IMPACT  PATTERN 
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METHODS  FOR  APPROXIMf  xNG  MATHEMATICAL  FUNCTIONS 


t 


Donald  W.  Rankin 

Army  Materiel  Teat  and  Evaluation  Directorate 
US  Army  White  Sand8  Missile  Range 
White  Sands  Missile  Range,  NM  88002 

ABSTRACT .  The  increasing  use  of  small  computers  to  control  complex 
systems  and  equipment  often  gives  rise  to  the  need  for  approximating  mathe- 
matical  functions  under  restrictive  conditions.  These  conditions  may  be  so 
varied  that  no  single  solution  can  be  called  optimum. 

Several  methods  are  discussed  in  some  detail,  with  a  view  toward  simpli¬ 
fication.  In  fact,  some  of  the  procedures  easily  can  be  committed  to  memory. 
An  unexpected  dividend  allows  the  analyst  to  employ  the  powerful  features  of 
some  programming  language  even  though  it  may  lack  a  needed  mathematical 
function. 

A  partial  list  of  subjects  addressed  includes: 

(1)  Power  series.  A  method  for  developing  snother  power  series  which 
converges  more  rapidly. 

(2)  Pads  approximations  (rational  functions).  Developing  a  PadS  expres¬ 
sion  from  a  truncated  power  aeries.  Reducing  the  Pad6  coefficients  to 

integers  (although  the  function  is  unique,  its  coefficients  are  not - a 

powerful  advantage). 

(3)  Operations  which  increase  accuracy. 

(a)  A  linear  combination  of  two  approximations  can  be  much  more 
accurate  than  either  alone. 

(b)  Properly  restricting  the  variable  range  can  markedly  improve 
the  rate  of  convergence. 

(c)  A  Pad6  expression  can  be  "optimized"  for  a  stated  variable 
range,  in  effect  embracing  both  of  the  above  advantages  in  a  single 
expression. 

(4)  Tschebychev  polynomials.  Tschebychev  series.  The  necessity  of 
employing  a  transformation  of  variables.  Choosing  a  transformation  which 
simplifies  the  Tschebychev  expression  and  reduces  the  labor  of  computing  the 
coefficients. 

(5)  Maehly'a  method  of  developing  a  rational  function  from  a  Tschebychev 
series. 

(5)  An  efficient  square  root  algorithm  which  does  not  require  access  to 
assembly  language.  Extension  to  higher  roots. 
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I.  INTRODUCTION.  There  is  increasingly  widespread  use  of  embedded  small 
computers  to  perform  monitor  and  control  functions  in  all  manner  of  complex 
systems.  The  mushrooming  demand  has  triggered  a  virtual  explosion  in  the 
computer  industry.  But  equipments  are  so  different,  and  conditions  can  be  so 
varied  that  no  library  of  software  routines  can  hope  to  avoid  either  paucity 
or  obsolescence. 


This  paper,  then,  is  not  a  catalogue  of  mathematical  approximations. 
Rather,  it  is  a  discussion  of  several  methods  which  can  be  used  to  produce 
required  approximations. 

The  methods  can  be  applied  to  any  mathematical  function  which  can  be 
developed  in  a  power  series.  Analogously,  a  digital  filter  can  be  synthe¬ 
sized  to  approximate  any  physical  process  capable  of  being  expressed  as  a 
linear  difference  equation  of  any  order. 


An  unexpected  dividend  accrued  recently.  An  analysis  program  was  written 
in  COBOL  to  take  advantage  of  the  powerful  "bookkeeping"  features  of  that 
lanquaqe,  even  though  it  contains  no  subroutine  for  computing  required 
logarithms.  Employing  the  methods  herein,  a  suitable  subroutine  was  easily 

devised. 


II.  RATIONAL  APPROXIMATIONS  TO  CERTAIN  MATHEMATICAL  CONSTANTS.  Sometimes 
it  is  useful  to  be  able  to  express  a  mathematical  constant  as  a  ratio.  This 
is  particularly  the  case  when  a  computer  (or  calculator)  will  compute  with 
greater  precision  than  it  will  store  or  accept  inputs.  The  author  own s  a 
calculator  which  computes  to  eleven  digits  but  accepts,  at  most,  eight-digit 
entries. 


A  carefully  chosen  ratio  will  deliver  as  many  significant  digits  as  there 
are  total  digits  in  the  fraction  (reduced,  of  course,  to  lowest  terms). 

There  are  several  methods  for  searching  for  these  approximations.  We 
illustrate  a  method  of  continued  fractions. 


Suppose  we  need  an  approximation  for  *  =  3.141592  653589  793238  46  ••• 
Taking  the  reciprocal  of  the  fractional  part,  we  express  it  as 


n  *  3  +  7.062513  305931  045769  8 


Repeating  the  procedure, 


IT 


3  -  ,  .  _> _ 

'  +  15.996994  406689  7199 


It  is  obvious  that 


will  be  an  excellent  approximation.  Unscrambling  the  continued  fraction 
yields 


x  »  3  ♦  »  in  error  by  only  2.67  *  10"7. 

If  every  numerator  is  unity  and  every  convergent  begins  with  an  integer, 
convergence  is  assured,  but  may  be  slow  (e.g.,  lno10).  A  convenient  way  ta 

hasten  the  process  is  to  begin  with  almost  any  recognizable  approximation, 
then  apply  the  continued  fraction  technique  to  the  residual.  Thus 

lne 10  =  2.302585  092994  045684  018  ••• 

23  _ 1 _ 

"  10  +  386.833279  229539  353860  ••• 

It  is  immediately  apparent  that 

.  in  .  23  1 _ 

lrV  1°  l0  +  386  +  5 

7 


will  be  an  excellent  approximation.  In  vulgar  fraction  form 
lrfe  10  •  *  which  errs  by  3.6  *  ID"10  . 

Similarly,  e  =  2.718281  828459  045235  36  can  be  expressed  as 

+  0.003996  114173  330949  646  ••• 


whence 


from  which 


19  _ 1 _ 

*  '  7  +  250.243100  328250  339641  ••• 


19 


2721 


6  "  "l  +  lOOl  =  Wi  (error  -  l.l  x  io-7). 


One  more  step  will  produce 

19 


e  = 


250  +  4.1 13527  970929  849525 


which  leads  to 


e 


19 

7 


+  250  ♦  — L-r 
4  «■  ? 
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Unscrambling, 


176180 

64813 


(error  **  2.3  x  10'9). 


It  should  be  apparent  that  any  mathematical  constant  can  be  approximated  with 
arbitrary  accuracy  by  an  easily-found  rational  fraction. 

Some  of  the  common  ones  are  listed: 


ir  =  3.141592  653589  793238  46 


e  =  2.718281  828459  045235  36* 


=  ~  -  2.67  x  10-7 


_  312689  n  no  _  iri-11 

11  =  ^9532  ’  2-90  10 


=  -  2.80  x  10-3 


_  49171  ?  77  «  10*70 

e  '  18089  *77  10 


=  9.869604  401089  358618  83 


/e~  =  1.648721  270700  128146  85- 


^  +  3.92  x  IQ'5 


2||||  -  5.52  *  10*' 


/7~  r  +  7.26  x  10-5 

*  u28  x  10''° 


s/180  =  0.017453  292519  943295  77  - 


=  7.389056  098930  650227  23* ••• 


180/n  =  57.295779  513032  320876  80 


1.65  x  10-6 


180/n 


+  4.87  x  10-6 


.  /  829471  c ,,  i n- 1  o 

180/n  =  — ^  ♦  7.52  x  10 


176761 

23922 


5.77  x  IQ'11 


In.  10 


=  2.302585  092994  045684  018* 


/IT  =  1.772453  850905  516027  30  - 

/» r  =  HI  -  1.24  *  10-6 
/7T  =  +  3.16  x  10-9 


In.  10 


In.  10 


~|  -  4.65  x  10-5 


53443 

23210 


+  3.62  x  10-10 


Luler's  constant 


u 


—  =  2.506628  274631  000502  42  • 


Y  =  0.577215  664901  532860  6065.. 


/rT  =  —  -  3.025  x  10’6 


Y  =  355  +  4.75  x  10-7 


221987 

88560 


+  1.492  x  10- 1 1 


33841  ,  ..  m- 11 

Y  =  58628  +  3,15  *  10 
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lna2  =  0.693147  180559  945309  4172* • 


ln#3  =  1.098612-288668  109691  395- •• 


lna2  =  -  3.46  *  10'5 


lna3  =  ^  ♦  2.07  x  10-5 


25669 

36744 


*  6.79  x  10-1 i 


ln,3  s  +  5.98  x  1Q-" 


III.  POWCR  SERIES.  Power  series  are  and  will  remain  most  useful  tools. 
Whenever  great  accuracy  is  required,  a  properly  chosen  power  series  will  deliver 
all  the  precision  of  which  the  computer  is  capable.  Let  us  look  at  a  familiar 
Maclaurln  expansion: * 


-  i  92  ,  9*  9 6  ,  9° 

*  fi  15  ”  17  |9 


This  series  converges  quite  rapidly,  providing  the  value  of  6  is  not  too 
large.  But  we  note  that  if  e  exceeds  we  need  merely  compute  cos  (|-  -  e)* 

Hence  ^  will  be  the  largest  value  of  the  argument  employed.  The  general  term 


can  be  written 


*2n  ■  <-l>"  H^T-7 


Any  term  can  be  computed  from  its  predecessor  by  means  of  a  term-to-term 
recurrence  ratio.  Thus 

*2n  =  2n(2n\  l)  *2n-2  (3) 

There  remains  only  to  compare  the  size  of  the  latest  computed  term  with  some 
pre-established  criterion.  If  the  term  is  small  enough,  the  computation  is 
finished.  Since  it  is  not  known  in  advance  how  many  terms  will  be  required, 
running  time  is  unpredictable,  making  the  method  unsuitable  for  real-time 
situations. 


Another  method  employs  the  "nested"  polynomial  technique.  The  series  is 

a8 

truncated  at  an  arbitrary  point  (e.g.,  after  the  term  and  the  arithmetic 
begun  at  the  other  end.  We  can  write 

««••>'  * 1  ■  ^ 


♦For  the  sake  of  uniformity,  throughout  this  paper  we  shall  employ  series 
wherever  possible  whose  leading  term  is  unity.  The  advantage  when  employing  a 
recurrence  ratio  is  obvious. 


185 


Since  most  computers  multiply  faster  than  they  divide*  it  may  be  more  efficient 
to  multiply  both  sides  by  (9a  362880,  yielding 

(((8*  -  72)82  ♦  3024)82  -  60480)02  +  362880 

-  362880  (4) 

In  this  form,  the  algorithm  will  have  a  fixed  running  time  (which  is  very 
fast),  but  the  error  will  be  a  function  of  the  argument.  The  maximum  error, 
however,  can  be  closely  estimated.  In  this  case  (for  sin  8)  it  is  given  by 

I E 1  <  ~mT  (4)n  =  1,757  *  10'9 

IV.  RESTRICTING  THE  RANGE!  OF  THE  ARGUMENT.  Supposing  that,  in  the  pre- 

0 

v i ou8  example,  we  had  computed  sin  —  ,  then  recovered  the  wanted  value  by  means 
of  the  identity  sin  3  $  =  3  sin  $  -  4  sin5  $  ,  or,  in  handier  form: 

= 3  - 4  sinZ  ♦  w 

The  errors  will  be  as  -9:1,  hence  (for  sin  3  $) 

|e|  <  -j—-  (j^)11  a  0.89  x  10- 15 

This  is  a  dramatic  reduction  in  the  maximum  error.  If  preferred,  the  series 
can  be  truncated  one  more  term,  simplifying  the  nested  polynomial  to 

((42  -  0Z)82  -  840)82  +  5040  -  5040  (6) 


Maximum  error  is  now  |c|  <  (y^-)9  =  1-433  x  10* 10 

We  can  carry  the  process  another  step,  using  the  identity 
sin  5  =  5  sin  #  -  20  sin5  $  +  16  sin5  $  or 

sin  5  _  .  .  2  ..2  . 

3in  ^  =  5  -  4  sin  $  (5  -  4  sin  $) 

The  errors  will  be  as  25:1,  hence 


I  e  1 


(— )7 
v20; 


1.17  x  10"8 


for  the  extremely  simple  expression 


(82  -  20) 92  +  120 


120 


sin  8 
8 


(7) 
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When  faced  with  a  very  slowly  converging  series,  some  such  technique  is 
virtually  mandatory.  The  aeries  for  logarithms  and  for  the  inverse  trigono¬ 
metric  functions  offer  typical  examples.  Let  us  look  at 

arctan  z  .  zi  zi  ii  .  ii  fq\ 

i  =  1  "  3  ♦  Y  “  T  +  9  .  '9) 

for  values  of  z  >  1,  we  employ  the  identity 

arctan  z  =  £  -  arctan  j 

but  this  alone  is  insufficient.  Clearly,  if  we  summed  a  billion  terms,  the 
maximum  error  would  still  be 

I e |  <  (2,000,000 ,001)* 1  s  5  *  IQ'10 

An  additional  step  is  required.  We  offer  two  choices. 

The  first  method  does  not  involve  square  roots.  (Some  computers  perform 
square  root  awkwardly  or  slowly.)  The  identity 

arctan  —  =  arctan  r— ^ —  +  arctan  - - - - r  (10) 

a  1  +  a  l  +  a+  a* 

is  employed.  This  requires  summing  two  series,  but  the  worst-case  argument 
cannot  exceed  y.  For  arctan  z 

|e|  <  29  (2)29  =  °’64  *  10~10 

after  only  fourteen  terms.  If  desired,  the  identity  can  be  employed  a  second 
time,  resulting  in 

arctan  —  =  arctan  t — —  +  arctan  - - - 5-  +  arctan  r - -- - +  (11) 

a  2+a  1  +  a  +  a‘  3  +  3a  +  a* 

Three  series  must  now  be  summed,  but  after  only  nine  terms  each,  the  maximum 
error  is  given  by 


| e I  <  79(3)19  =  0*9  *  10-’° 


In  the  second  method,  the  identity 


arctan  x  =  2  arctan 


1  +  i\  +  x2 
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is  employed.  The  argument  does  not  now  exceed  -  a  0.4142* •«,  and 

1  /2 

after  11  terms,  maximum  error  is  |e|  <  1.36  x  ID*10.  Obviously,  the  iden¬ 
tity  can  be  applied  a  second  time,  yielding 

arctan  x  =  4  arctan  — - -  — - -  (13) 

1  +  u  ♦  /2(u^  *  u) 

where  u2  =  1  ♦  x2.  The  argument  now  i9  no  greater  than  0.1989...,  and  after 
only  six  terms,  |e|  <  2.35  *  10*10. 

The  identity  can  be  applied,  of  course,  as  many  times  as  desired.  One  more 
application  yields,  after  only  four  terms, 

lei  <  7.75  x  lO-’O 

Of  all  the  tools  that  can  be  used  to  increase  the  accuracy  of  an  approxi¬ 
mation,  restricting  the  argument  range  is  perhaps  the  most  important,  and 
should  be  given  the  highest  priority. 

V.  PAPE  APPROXIMATIONS.  Oust  as  the  ratio  of  two  integers  can  be  used  to 
approximate  an  irrational  number,  so  can  the  ratio  of  two  polynomials  be  used 
to  approximate  a  transcendental  function.  If  the  function  can  be  expanded  in 
a  power  series,  there  i9  available  a  part icularly  easy  method  (called  a  Padd 
approximation)  for  obtaining  such  an  approximation.  The  expression  will  look 
like 


l+ex+axZ+ax* 

_ _ Z. _ 2 — - 

l+bx+bx2+bx’ 
1  2  ) 


1  +  C  X  +  C  X2 
1  2 


C  X }  +  c  x**  -*■  ■  •  • 

}  U 


(14) 


It  is  easy  to  see  that  we  can  multiply  the  coefficients  of  the  rational  func¬ 
tion  by  any  arbitrary  constant  without  changing  its  value.  Thus  although  the 
function  may  be  unique,  its  coefficients  never  are.  Herein  lies  a  second 
advantage  of  the  Padd.  In  many  cases  the  coefficients  can  be  reduced  to  small 
integers.  In  this  form,  the  Padd  is  very  economical  of  both  running  time  and 
storage  9pace,  qualities  which  cannot  lightly  be  ignored. 

There  is  no  restriction  on  the  degree  of  polynomial  used  in  either  numera¬ 
tor  or  denominator.  However,  the  better  approximations  occur  when  the  degrees 
of  numerator  and  denominator  do  not  greatly  differ,  and  are  significantly  more 
accurate  than  the  truncated  power  series  from  which  they  have  been  derived. 

The  Padd  is  much  better  behaved  in  the  region  of  maximum  error  (the  slope  of 
the  Padd  error  curve  is  less  steep) . 

A  word  of  caution  is  nece93ary.  All  polynomials  have  zeroes.  All  poly¬ 
nomials  of  odd  degree  have  at  leant  one  real  zero.  Zeroes  in  the  denominator 
usually  produce  poles  in  the  rational  function.  If  such  a  pole  is  fictitious 
(i.e.,  there  is  no  corresponding  pole  in  the  function  being  approximated),  the 
approximation  will  "blow  up"  as  the  argument  approaches  the  pole.  For  real 
values  of  the  argument,  complex  poles  usually  cause  no  difficulty,  nor  do  real 


* 
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poles  which  lie  well  outside  the  argument  range  being  used.  (We  have  found  an 
additional  reason  for  restricting  the  argument  range.) 

It  will  be  found  that  if  a  power  series  is  truncated  after  that  term  whose 
exponent  is  equal  to  the  sum  of  the  degrees  of  the  corresponding  Padd  numera¬ 
tor  and  denominator,  it  provides  just  enough  known  coefficients  to  enable  ua 
to  compute  the  unknown  ones  (in  the  Padd  expression). 

The  basic  form  for  computation  is  written 


1  ♦  a1  x  + 
1  +  b1  x  + 


•♦■ax 
n _ 


+  b  x*1 
m 


1 


2 

+  c  1  x  +  c2x 


■f 


c 

n*m 


X 


(15) 


in  which  the  c  's  are  known,  the  a  's  and  ^'9  unknown.  Both  sides  are 
multiplied  by  tf>e  Padd  denominator,  yielding 


1  +  a^x  + 


a  x 
n 


(1  +  b  x  b  xm)(l  +  c  x  +  ex2  +  *  •  •  ♦  c  xn  +  m)  (16) 

1  m  12  n  +  m 

The  indicated  multiplication  is  performed  on  the  right.  Like  terms  are  col¬ 
lected,  dropping  all  terms  of  degree  greater  than  m  +  n.  These  terms  will  be 
9mall,  but  dropping  them  explains  why  the  Padd  and  power  series  approximations 
yield  different  results. 

The  coefficients  of  terms  of  like  degree  are  now  equated  (after  supplying 
the  left  side  with  m  terms  equal  to  zero) .  This  gives  rise  to  a  system  of 
m  +  n  simultaneous  linear  equations  in  m  +  n  unknowns.  Now  it  i3  seen  why 
the  higher  degree  terms  are  dropped.  Retaining  them  would  lead  to  additional 
equations  which  might  be  (indeed  usually  are)  inconsistent. 

The  required  set  of  simultaneous  linear  equations  can  be  written  directly 
by  means  of  the  following  algorithm: 

Step  1.  Imagining  a  checkerboard  enter  the  known  coefficients  -  the 

coefficients  of  the  given  power  series  -  along  the  principal  diagonal. 

c0  =  l  is  a  known  coefficient. 

Step  2.  Below  cn*ro  draw  a  horizontal  line.  No  terms  will  be  entered 
below  this  line. 
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Step  3.  Enter  the  first  unknown  coefficient  (bx)  everywhere  along  the 
next  diagonal. 

Stop  A.  Repeat  step  3  for  each  successive  coefficient  of  the  Padd 
denominator . 

Step  5.  When  all  the  b/e  have  been  entered,  there  will  remain  at  the 
lower  left  corner  an  empty  n  *  n  triangular  pattern.  Fill  it  up  with 
zeroes. 

Step  6.  Multiply  each  bt  by  the  known  coefficient  at  the  head  of  the 
column. 
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1 

b1 

C1 

cib. 

c2 

b* 

c2b, 

c3 

0 

<=1bm 

c,b, 

0 

0 

b2b« 

ci*4n 

Step  7.  Each  horizontal  row  represents  the  right  side  of  one  of  the 
required  equations.  Add  +  signs  between  appropriate  terms.  Place  an  = 
sign  to  the  left  of  each  expression. 

Step  8.  Arrange  the  coefficients  of  the  Pad4  numerator  (the  a  ' s)  in  a 
vertical  column,  supplying  m  zeroes  at  the  bottom.  Each  element  becomes  the 
left  side  of  the  appropriate  equation.  The  complete  set  of  equations  can  now  be 
written. 


1  =  l 


ai  =  bi  ♦  cl 

a2  =  b2  +  c , b ^  ♦  c2 


b3  +  c,b2  ♦  c2b,  ♦  c, 


[i. 


i 


Step  9.  Those  equation?  whose  left  member  is  zero  form  a  set  of  m  equa¬ 
tions  in  m  unknowns,  and  can  be  solved  for  the  bj's. 

Step  10.  The  a^s  are  found  by  direct  substitution. 

Step  11.  Multiplying  all  the  Padd  coefficients  by  their  l.C.D.  converts 
them  to  integers. 


Let  us  illustrate  by  example.  Suppose  it  is  required  to  write  a  routine 
for  arctan  z.  Let  us  arbitrarily  decide  to  write  a  Padd  of  the  form 


arctan  z 


-  ,U  »  ax2), 

“  1  +  b2*z  ♦  b$x* 


using  the  transformation  x 
ment. 


- - — -  to  restrict  the  range  of  the  argu- 

1  ♦  /l  z2 


Under  this  transformation 

arctan  z  s  2  arctan  x 


so  we  already  know  that  k  =  2. 

Note  that  for  an  even  function,  we  can  mentally  make  the  substitution  w  = 
x2  and  derive  the  Padd  expression  as  though  it  were  a  function  of  w. 

The  appropriate  power  series  is 

assists*  !  .  ...  (9> 

We  require  a  rational  function  such  that 

1  +  ax2 _  arctan  x 

1  +  b2x2  ♦  b4x^  *  x 

Immediately  we  write  the  known  coefficients  as  a  diagonal,  then  add  the 

b'  8. 


1 

b 

2 


0 


b 


i 

* 


7 
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Next  we  multiply  by  the  known  coefficient  at  the  head  of  each  columnt 
1 


b2 

i 

"  5 

b4 

i 

5 

0 

■rb, 

Tb2 

1 

"  7 

After  adding  the  column 

Of  S*  8 

(and  zeroes) 

to  the 

left  and  supplying 

the  needed  =  and  +  signs 

,  we  have 

the  required  set  of 

equations: 

1=1 


„  1  .  i 

0  =  b4  "  7  b2  +  5 

o  I  0  -  y  |  b2  -  ■} 


Those  equations  whose  left  member  is  zero  form  a  set  vrfiich  can  be  solved 
for  the  b's.  Thus,  after  multiplying  the  last  equation  by  3,  we  find 


0 


0 


Adding, 


b 

4 


J 

7 


0  = 

from  which 


15  b2 


_8 

55 


b 

2 


_0  .  J_5  _  6 
55  4  "  7 


0 


or 


b 

4 


_ 5 

55 
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and 


The  maximum  error  occurs  when  x  s  </2  -  1  where,  of  course  z  =  1  and 

arctan  z  =  t 
4 

(J  =  0.78539  81634) 

The  Padd  approximation  yields  (at  z  s  1) 

arctan  z  -  0.78539  52528  427 
in  error  by  -  2.91  x  10”6 

The  truncated  power  series  from  which  the  Padd  was  derived  yields 

arctan  z  «  0.78532  81810  156 
in  error  by  -  7.00  x  10** 

The  Padd  is  noticeably  better,  as  it  often  is  when  the  power  aeries 
converges  slowly  —  another  plus  for  the  Padd.  It  works  best  when  needed 
most . 
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The  algorithm  for  computing  Padd  coefficients  alao  can  be  used  to  develop 
a  reciprocal  power  aeries  whenever  one  is  required.  Since  all  the  a1'a  are 
zero  (except  a-  a  1,  of  course),  the  left  sides  of  all  pertinent  equations 
become  zero,  eliminating  the  need  for  a  simultaneous  set.  The  b,'8  can  be 
determined  in  order,  for  any  arbitrarily  large  number  of  them. 

After  truncation,  both  the  original  power  series  and  the  reciprocal  aeries 
can  be  thought  of  as  limiting-case  Padd  approximations. 

Suppose  that  a  function  is  expressible  in  power  series  form  and  that  the 
Padd  algorithm  haa  been  used  to  develop  the  reciprocal  series.  We  can  write 

f(x>  s  1  ♦  9,x  +  s  1  +  hlX  + 


The  latter  can  be  written 


f  ( x) 


_ 1 

=  1  +  M1  *  k,X  ♦  k2XZ+  •••) 


and  the  process  repeated  upon  the  interior  series,  yielding 


1  +  1  +  Z,x  +  Z2x2  +  ••• 

Continuing  in  this  manner  develops  the  function  in  continued  fraction  form, 
and  emphasizes  the  close  relationship  between  Padd  approximations  and 
continued  fractions. 


If  a  continued  fraction  is  terminated  at  some  nth  convergent,  it  becomes 
an  approximation  to  the  function.  It  is  easy  to  evaluate  by  a  process  of 
"nested  division"  not  unlike  the  evaluation  of  polynomials  by  "nested 
multiplication." 

Use  of  the  continued  fraction  technique  often  results  in  a  program  of  sig¬ 
nificantly  fewer  instructions.  However,  so  many  divisions  must  be  performed 
that  running  time  may  be  quite  slow.  There  are  some  models  of  computers  which 
divide  quite  rapidly  (usually  in  single  precision  only),  for  them,  the  use  of 
continued  fraction  approximations  seems  attractive. 

If  a  terminated  continued  fraction  is  "unscrambled,"  it  will  be  found  that 
the  resulting  rational  function  is  a  member  of  the  set  of  Padd  approximations. 


VI.  DEVELOPING  A  POWER  SERIES  WHICH  CONVERGES  MORE  RAPIDLY.  Let  us  con¬ 
tinue  to  use  as  an  example  the  series  for  arctan  x.  Repeating  for  convenience 


arctan  x 
x 


s  1 


iy2  *  iv4 


1  *  Id 
yx°  +  - 


(9) 
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a  series  which  converges  very  alowly.  Since  this  series  (as  the  right  aide  is 
stated)  is  an  even  function  of  x,  we  mentally  make  the  substitution  w  a  x? 
and  treat  it  as  an  expansion  in  w. 


Now  if  the  ratio  of  two  successive  coefficients  approaches  some  definite 
limit  l  (*  0)  as  the  exponent  of  w  increases  without  bound,  multiplying  by 
(1  -  w/l)n  will  produce  a  new  series  which  converges  more  rapidly  (n  is  a 
positive  integer) .  It  will  be  necessary  to  include  enough  terms  to  accommo¬ 
date  the  multiplier.  Thus,  since  in  the  present  case  l  s  -1, 


1  +  x^ 


arc tan  x  a 


1  ♦  fx2  - 


i'Y 


S'?' 


-  7-9* 


(18) 


and 


(1  t  X2)2 

x 


arctan  x 


.  5  2  8x4  8x6  8x9 

1  +  T*  +  1*3*5  ~  3*5*7  5*7*9  * 


(19) 


1  +  x2 

Using  the  series  for  — - -  arctan  x,  let  us  develop  a  Pedd  expression 

which  has  one  less  term  in  the  denominator  than  did  our  previous  effort.  The 
work  layout  is  so  simple,  we  could  almost  do  it  in  the  head. 


1  =  1 


a  = 


b  « 


2 

3 


r,  Z.  2 

0  *  3b  *  13 


from  which  b  z  ^  and  a  *  The  required  expression  is 


(1  ♦  X?)  ^c-tan  * 


1  ’5  2 

1  +  iT* 

,  1  2 

1  ♦  5X 


Dividing  both  sides  by  1  +  x2, 


arctan  x 
x 


1  —x2 

1  *  15* _ 

1  +  yx2  +  yx4 


for  purposes  of  comparison,  let  us  multiply  by  the  same  integer  as  before. 
Thus 


arctan  x  105  +  91x2 

x  =  105  +  126x2  +  21x4 
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Since  we  used  fewer  terms  of  the  power  series  to  develop  this  expression,  we 
expect  less  sccuracy,  and  indeed  at  z  a  1, 

arctan  z  «  0.78530  37156  499 


in  error  by  -9.445  *  10'* 

VII.  OPTIMIZING  A  PAD&  APPROXIMATION  FOR  A  STATED  VARIABLE  RANGE .  Once 
the  decision  has  been  made  which  sets  the  range  over  which  the  argument  will 
be  allowed  to  vary,  the  Padd  approximation  can  be  "optimized"  for  that 
specific  range.  If  the  range  restrictions  change,  so  must  the  optimizing 
coefficients. 


Let  us  continue  with  the  example,  arctan  z. 

Arranging  the  Padd  coefficients  in  matrix  form,  and  labeling  them  primary 
and  secondary  according  to  the  order  in  which  they  were  developed,  we  find 


and 

-  / 105  91  o\ 

b  \105  126  21/ 

Let  us  produce  what  we  shall  call  the  "delta  matrix"  by  performing  the 
subtraction 


S  -  P  =  4 


If  the  work  has  been  performed  correctly,  the  first  column  will  be  zeroes  and 
the  second  constant.  If  desired,  the  delta  matrix  can  be  reduced  to  lowest 
terms. 


It  turns  out  that  any  multiple  of  4,  when  added  to  P,  produces  a  linear 
combination  of  P  and  S,  which  will  define  another  Padd-like  approximation. 
Its  error  curve  will  be  a  linear  combination  of  two  monotonic  error  curves 
which  are  not  congruent,  and  hence  the  combination  error  curve  will  not  be 
monotonic,  but  will  have  two  zeroes.  One  will  be  at  the  origin.  The  other 
can  be  positioned  arbitrarily.  By  choosing  that  point  near  the  maximum 
allowable  value  of  the  argument,  the  approximation  can  be  markedly  improved. 


To  return  to  the  example,  the  maximum  errors  of  P 

form 


and  S  suggest  the 


P 


s  P’ 


/ 105 

54 

\105 

89 

/3i5 

162 

\31* 

267 
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The  full  expression  is 


arctK, , .  «(»°  t  . 

I  .  3X5  ,  267*1  ,  26x« 


where 


1  +  /l  +  22 

When  2=1,  x  =  / T~  -  1,  arctan  z  =  n/4.  The  approximation  yields  0.78539 
79371  287,  in  error  by  -2.2627  x  10-7.  This  is  sn  order  of  magnitude 
improvement . 

Let  us  define  relative  error  as  follows: 

relative  error  =  ^trul^value0  ~  *  (20) 

Graphing  the  relative  error  of  the  approximation  P',  we  see  that  it 
varies  from  +2.814  x  10-7  at  z  =  0.77  to  -2.881  x  10-7  at  z  =  1.00, 
leaving  virtually  no  room  for  improvement,  for  all  practical  purposes,  then, 
P'  optimizes  P.  (See  Fig.  1) 

Supposing  that  we  apply  the  transformation  a  second  time,  add  a  term  to 
the  numerator,  and  optimize.  How  much  accuracy  would  be  gained? 


n  +  a  x2  ♦  a  x*\ 
arctan  z  -  4x  ^  +  b^f  TbV] 


where  vZ  =  1  +  zz  and  x  = 


1  +  v  + 


1  =  1 


a  =  b 
2  2 


a  =  b 

4  4 


1  h  JL 
3  2  *  5 


0  =  0 


1  b  ♦  i  b 
3  4  5  2 


0  =  0  +0 


+  1  b  -4b 


The  basic  approximation  then  computes  to  be 


,  .  ,  945  +  735x2  +  64x4 

arctan  z  -  4x  +  1050x2  +  225xa) 
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When  1*1,  x  =  0.1989  •••  ,  r.e.  *  1.3  *  10”T° 

The  second  form  is  derived  from 


1*1 


a2  =  b 


a4  =  0 


♦  T  b 


0=0  +0 


from  which 


6  =  | 


!  .  *V  . 

21  10? 

1  ♦  ifc2  ♦ 


yielding 


*  T?b 


2  J 

®2  =  21 


8d  S  105 


reduces  to 


.  /945  1035x2  +  144x4\ 

arctan  z  -  4x^45  +  1J50x/  “  a05xV 


300  80> 

300  180, 


P  -  0.0073734  ia  virtually  optimum,  giving 

.  .  /  945  +  732 ,7B75xz  ♦  63.41x*  \ 

arctan  r  -  4x^45  +  10A7<7875xz  +  223.6725xV 

r.e.  ia  within  ♦.  8.1  x  10-,2 

Multiplying  through  by  400  obviously  recovers  integer  coefficients. 

The  relative  error  curves  for  these  two  "optimized"  approximations  are 
plotted  in  figure  1.  The  form  of  the  error  curve  near  the  maximum  argument  is 
typical . 

The  approximation  P  -  0.00744  is  close  to  optimum  and  yields  smaller 
coefficients,  viz., 

„  .  „  A  / 1573  +  1221 . 3x;  +  105.68xA 
arctan  z  -  4*^575V  1746. 3 x*  ♦  372. 78x*/ 

Multiplying  by  50  recovers  integers. 

Additional  Pad£  approximations  are  given  in  the  appendices. 
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VIII.  THE  USE  or  TSCHEBVCHEV  POLYNOMIALS.  Let  us  introduce  the  following 
shorthand  notation! 

To  *  cos  (0)  =  1 

Ti  s  cos  6 

T2  -  cos  28  s  2  cos2  8-1 

Tj  s  COS  38  a  4  COS5  8-3  cos  8 


Tn  =  cos  n0  (21) 

Recalling  that 

cos  (n8  +  8)  =  cos  8  cos  n8  -  sin  8  sin  n8 


and 


cos  (n9  -  8)  =  cos  8  cos  n8  +  sin  8  sin  n8 
we  obtain  by  simple  addition 

cos  (n8  -  8)  +  cos  (n8  8)  a  2  cos  8  cos  n8  (22) 

which,  stated  in  the  shorthand,  becomes 

Tn-,  +  Tn*1  =  2T! T„  (23) 

This  trigonometric  identity  -  sometimes  called  a  three-term  recurrence 

relation  -  enables  us  to  compute  any  from  the  previous  two.  The 

result  will  be  an  expression  in  the  various  powers  of  cos  6. 

Further  to  simplify,  let  ua  make  the  parametric  substitution  x.=  cos  6. 
This  results  in 

To  =  1 

Tl  =  * 

T2  =  2x2  -  1 
T  3  =  4x  5  -  3x 

T<,  =  8x4  -  8x  2  +  1  (24) 


etc. 
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We  have  just  generated  the  set  of  Tachebychev  polynomials* 

Since  any  Tn  can  be  expressed  aa  the  cosine  of  some  angle ,  it  oan  assume 
only  values  between  the  limits  -1  and  1.  Thus  each  Tn  can  be  said  to  be 
an  equal  ripple  function,  with  known  extrema. 

It  should  be  apparent  that  any  analytic  function  can  be  expressed  aa  a 
series  in  the  Tn’s,  this  being  merely  a  special  case  of  a  Fourier  series. 

Turning  our  attention  to  the  parameter  x  s  coa  9,  it  is  seen  that  it, 
too,  is  subject  to  the  constraint  -  1  <  x  <  1.  For  values  of  the  argument 
outside  these  limits  a  Tachebychev  series  will  not  converge.  Sometimes  this 
difficulty  can  be  overcome  by  a  suitable  transformation  of  variables.  For 
example,  suppose  it  is  desired  to  expand  ln8z  in  a  Tachebychev  series.  The 
argument  is  very  badly  behaved  (0  <  z  <  «),  but  the  transformation 


suggests  itself.  Solving  for  z, 


1  X 

Hence  lne *)  can  be  expanded  in  a  Tachebychev  series  in  the  variable  x. 

Theoretically,  the  number  of  these  transformations  is  endless,  hence  there 
is  no  unique  Tschebychev  series  for  any  mathematical  function.  Thus  when 
stating  a  Tschebychev  expansion,  it  is  necessary  to  state  also  the  transfor¬ 
mation  used. 

It  iB  useful  to  be  able  to  express  the  various  powers  of  x  in  terms  of 
the  Tschebychev  polynomials.  This  Is  easily  done  by  what  I  call  "half"  a 
binomial  expansion.  When  there  is  an  even  number  of  terms  In  the  quasi- 
binomial  expansion,  no  manipulation  la  necessary.  But  when  there  is  an  odd 
number,  the  middle  term  must  be  halved.  This  always  happens  to  the  term 
involving  T0,  so  that  it  is  convenient  to  use  T0/2  and  then  employ  the 
binomial  coefficients.  To  illustrates 


1  : 

:  To 

X  : 

■  T1 

x2 

II 

N) 

+  2-j) 

=  j(T2  +  T0) 

X5 

+  3T , ) 

x4 

=  i<t4 

♦  4T2  + 

^7>  =  J<T*  ♦  4TZ+  3T0) 
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Xs  *  ii»>  *  51,  .  10T,  J 


<"  .  —~a  *  n!  ,  .  alBiU.  I  ,  * 

n- 1  n  n-2  2  n-$ 


(25) 


As  a  check,  the  sum  of  the  interior  coefficients  must  equal  the  exterior 
denominator. 


If  a  given  function  can  be  expressed  as  a  power  series,  and  if  the 
argument  behaves  properly  (i.e.,  its  absolute  value  does  not  exceed  irity), 
direct  substitution  of  the  various  expressions  for  xn,  followed  by  a 
collection  of  terms,  results  in  a  Tschebychev  series,  in  which  the  function  is 
expanded  in  terms  of  the  successive  Tschebychev  polynomials,  rather  than  in 
ascending  powers  of  the  argument. 

Tschebychev  series  have  two  extremely  useful  properties,  which  doubtless 
are  responsible  for  the  considerable  popularity  of  the  method.  They  are: 

1.  A  Tschebychev  series  will  converge  more  rapidly  than  any  other  series 
(given  the  same  function  and  argument).  Thus,  the  desired  accuracy  often  can 
be  attained  with  fewer  terms. 


2.  Each  Tschebychev  polynomial  ia  an  equal  ripple  function  (between  the 
argument  limits  -1  and  +1).  There  will  be  n  +  1  of  these  extrema, 
interlaced  by  n  zeroes.  The  value  of  these  extrema,  of  course,  ia  +  J, 
since  Tn(x)  =  cos  nx.  It  has  been  proven  that,  of  all  polynomials  of  like 
degree  whose  highest-degree  terms  have  the  same  coefficient,  the  Tschebychev 
polynomial  has  the  smallest  extrema.  In  other  words  the  maximum  (extremum)  is 
minimized.  Hence  the  term  "minimax."  Thus,  vrtien  an  approximation  is  computed 
from  a  truncated  Tschebychev  series,  the  error  is  closely  given  by  the 
coefficient  of  the  first  neglected  Tschebychev  term  (see  Fig.  2). 

A  Tschebychev  series  is  essentially  a  Fourier  cosine  series,  and  the 
coefficients  of  course  can  be  computed  by  evaluating  the  pertinent  definite 
integrals.  This  can  be  laborious.  Since  usually  we  will  be  dealing  with 
weil-behaved  analytic  functions,  it  normally  will  be  better  to  expand  the 
function  in  a  power  series  and  then  develop  the  Tschebychev  series  by  direct 
substitution  and  collection  of  terms,  in  the  manner  already  seen.  It  will  be 
observed  that  each  Tschebychev  coefficient  is  itself  the  rum  of  an  infinite 
series.  These  latter  series  are  easily  summed  by  a  programmable  calculator, 
since  they  involve  nothing  more  complicated  than  powers  of  s  constant,  facto¬ 
rials,  and  binomial  coefficients. 

When  the  function  being  approximated  has  a  zero  in  the  range  (-1,  1),  s 
Tschebychev  approximation  may  behave  very  badly  near  this  zero.  Simply 
stated,  the  error  in  the  approximation  may  exceed  the  value  of  the  function. 
Often,  this  is  less  a  fault  than  a  result  of  clumsy  handling.  In  most  cases, 
the  zero  can  be  removed  by  altering  the  function  slightly.  We  shall 
illustrate  by  example. 
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We  wish  to  develop  a  Tsehebychev  series  for  sin  6.  Now 

gi  e7 

llnl.e--*---* 

Letting  the  argument  vary  from  -  ~  to  we  see  that  the  proper  transforma¬ 
tion  is  e  =  £x.  Direct  substitution  yields  a  series  in  those  Tsehebychev 

polynomials  of  odd  subscript.  Truncation,  however,  produces  an  approximation 
which  is  of  little  use  when  sin  9  is  very  small.  The  problem  is  very  easily 
corrected.  Noticing  that  0  factors  the  right  side, 

sin  e  .  e*  o4  e6 

r~ =  1  (U 


The  function 


fact, 


sin  0 


sin  9 


>  0.9. 


contains  no  zero  within  the  chosen  argument  range.  In 


sin  6  x  92  0“  06  . 

-  •  -  (1  •  —  +  —  -  —  4.  *  *  •  ) 

x  4  u  15  l>  17  } 

does  virtually  as  well  and  leads  to  some  simplification.  It  is  obvious  that, 
in  the  last  two  cases,  the  two  Tsehebychev  series  will  involve  only  those 
polynomials  of  even  subscript. 

It  is  instructive  to  show  the  complete  derivation  of  a  Tsehebychev  series 
for  sin  0.  From  the  last  equation,  we  have 


,  ^ 
sin  x 


(i)iii  +  (I) bill  _  +  ... 

'4 '  |J  *  '4  7  V47  |7 


Repeating  for  convenience 


x* 

o 

♦ 

N 

*— 

II 

x- 

■  in,  * *  ”0> 

xb 

s  7?^,  +  6Ta  .  1ST,  * 

10T0) 

x« 

--  m(re *  8T,  *  28,«  ‘ 

56T  2  + 

x»° 

*  T77(,io  *  ‘HU  * 

♦  120T, 

,  ZlOIj  ♦  126T0) 

xl2  3  rois(Ti2  +  12T10  ♦  66T9  +  220T 6  +  495T,  ♦  792T2  +  462T0) 

x14  s  ffTTF ^ T 1 4  +  14T12  ♦  91Tio  +  >64T8  +  1001 T*  ♦  2002T4  +  300JT2  +  1716T0) 
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x16  =  j~j(T16  +  16Tia  +  120T12  ♦  560T10  ♦  1820Te  +  4368T*  +  8008T* 


♦  11440T2  ♦  6435T0) 


(25) 


etc. 

To  shorten  the  notation,  set  p  s  j.  Then 

SLimii  s  p  _  pJii  +  p«s!.  _  p?jd  +  ... 

The  desired  Tschebychev  series  is 

~  a0^0  +  82T2  ♦  8*T*  +  **• 
Substituting  and  collecting  similar  terms, 


(26) 


(27) 


1  ?  1  s  1  7 

ao  =  p  -  tt  p  +  inr  p  -  T§nnr  p  ♦  ••• 


The  general  term  ijiri  is  given  by 


1 1  •  \  n 

*,  .  =  li-ll.. 

2n*’  22n-1 


2n+ 1 

£ _ 

12n  +  l 


C(2r,,  n) 


The  term-to-term  recurrence  ratio  is 


ao 


0)  s 

2(1  +  1 

1 

"  4 

0.785398 

163397 

448309 

62 

-0.040372 

756094 

140390 

85 

0.000933 

897963 

822270 

06 

-0.000011 

430063 

806930 

39 

0.000000 

085684 

836346 

28 

-0.000000 

000432 

447669 

64 

0.000000 

000001 

567473 

35 

-0.000000 

000000 

004275 

39 

0.000000 

000000 

000009 

09 

-0.000000 

000000 

000000 

02 

0.745947  960457  275642  10 


2n  -  1 
2n  ♦  1 


n  T2n-1 


In  a  like  fashion, 

15  Is  1  7  1  9 

82  5  -  7T  P  +  775  P  ~  77T57  P  +  S"294'40'  P  * 


(28) 


(29) 
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The  general  term 


is  given  by 


* 

204-1 


s  i=m 

22n-1 


e2hi1 

1204-1 


C(2n,  n  -  1) 


Note  the  absence  of  the  leadln,  factor  ^  -  That  factor  applies  only  to 
The  term-to-term  recurrence  ratio  ie 


2n>1 


I  2n  -  1  pj —  y 

4  2n  ♦  1  n2  -  1  2o-1 


0* 


-0.040372  756094  140390  85 
0.001245  197285  096360  08 
-0.000017  145095  710395  59 
0.000000  137095  738954  04 
-0.000000  000720  746116  06 
0.000000  000002  687097  16 
-0.000000  000000  007481  94 
0.000000  000000  000016  16 
-0.000000  000000  000000  03 
-0.039144  567527  081957  02 


Continuing 


1  5 

84  *  9?o  P  - 


1 


1 


26880 


1  658880 


- 


The  general  term  *n  is  given  by 


♦ 

2n  + 1 


p2n»  1 

12n  4*  1 


C(2n,  n  -  2) 


The  term-to-term  recurrence  ratio  ia 

1  2n  -  1  P2_  y 

^2ri*l  "  ”  4  2n  +  1  nz  -  4  2r>-1 

0.000311  299321  274090  02 
-0.000006  858038  284158  23 
0.000000  068547  869477  02 
-0.000000  000411  854923  46 
0.000000  000001  679435  73 
-0.000000  000000  004987  96 
0.000000  000000  000011  31 

-0.000000  nnoooo  000000  02 

fly  0.000304  509420  678944  41 
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For  any  coefficient  82i<  (k  :  1,  2,  3,  *••) 

'fcnt-l  s  ^2n-1  12n  +  1  C^2n>  ^  -  k)  (30) 

The  first  term  is  obtained  by  setting  n  s  k.  The  term-to-term  recurrence 
ratio  is 


*  -  _  I  2 Q 1  .  jb!  —  a, 

-  4  *  2n  ♦  1  *  rT  -  k*  ^n-i 

To  complete  the  example,  the  rest  of  the  calculations  follow* 


(31) 


-0.000001 

143006 

380693 

04 

0.000000 

019385 

105564 

86 

-0.000000 

000154 

445596 

30 

0.000000 

oooooo 

746415 

88 

-0.000000 

oooooo 

002493 

98 

0.000000 

oooooo 

000006 

17 

-0.000000 

OOOOOO  OOOOOO  01 

-0.000001 

123574 

976796 

42 

0.000000 

002448 

138195 

61 

-0.000000 

000034 

321243 

62 

0.000000 

oooooo 

223924 

76 

-0.000000 

oooooo 

000906 

90 

0.000000 

oooooo 

000002 

57 

-0.000000 

oooooo 

OOOOOO 

01 

0.000000 

002414 

039972 

41 

-0.000000 

000003 

432124 

36 

0.000000 

oooooo 

040713 

59 

-0.000000 

oooooo 

000226 

73 

0.000000 

oooooo 

OOOOOO 

79 

-0.000000 

000003 

391636 

71 

0.000000 

oooooo 

003392 

80 

-0.000000 

oooooo 

000034 

88 

0.000000 

oooooo 

OOOOOO 

17 

0.000000 

oooooo 

003358 

09 

-0.000000 

oooooo 

000002 

49 

n. oooooo 

oooooo 

OOOOOO  02 

-0.000000 

oooooo 

000002 

47 

al6  "  1.4  x  10-2’ 
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As  a  simple  yet  powerful  check  on  the  calculations,  the  sum  of  the  coeffi¬ 
cients  should  equal  the  value  of  the  function  at  x  2  1.  (When  x  s  1,  every 
Tschebychev  polynomial  also  equals  unity.)  In  the  present  instance 

»ia_a«  t  i/r  ,  u 

When  employing  a  Tschebychev  series  in  the  basic  form,  the  values  of  the 
successive  Tschebychev  polynomials  corresponding  to  the  stated  value  of  the 
argument  are  easily  obtained  by  repeatedly  applying  the  proper  trigonometric 
identity  (three-term  recurrence  relationship). 

When  a  Tschebychev  series  has  been  truncated  for  use,  the  maximum  error  in 
the  approximation  is  given  (nearly)  by  the  first  neglected  coefficient.  The 
equal  ripple  feature  distributes  this  maximum  error  (small  though  it  be) 
throughout  the  argument  range.  It  is  obvious,  then,  that  restricting  the 
range  of  the  argument  will  not  reduce  the  maximum  error.  Again,  this  may  be 
less  a  fault  than  a  faux  pas. 

Consider  what  happens  when  the  principles  of  range  restriction  are  applied 
before  selecting  a  transformation. 

Continuing  to  use  the  sine  function  as  an  example,  suppoae  we  let  0  =  54>- 
Computing  sin  9,  we  recover  the  wanted  function  by  means  of  the  identity 

”4*  2  5-4  ain2  9(5  -  4  sin2  9)  (7) 

Bin  9 

But  9  need  not  exceed  9  degrees.  Hence  we  can  develop  a  Tschebychev 
series  for  the  transformation 


IT 

P  =  20 

All  formulae  remain  the  same.  It  is  only  rycessary  to  substitute  the  new 
value  of  p. 


0.157079  632679  489661  92 
-0.000322  982048  753123  13 
0.000000  298847  348423  13 
-0.000000  000146  304816  73 
0.000000  000000  043870  64 
-0.000000  000000  000008  86 
a0  0.156756  949331  824006  98 

-0.000322  982048  753123  13 
0.000000  398463  131230  84 
-0.000000  000219  457225  09 
0.000000  000000  070193  02 
-0.000000  000000  000014  76 
a2  -0.000322  583805  008939  13 
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0.000000  099615  782807  71 
-0.000000  000087  782890  04 
0.000000  000000  035096  51 
-0.000000  000000  000008  43 
a4  0.000000  099528  035005  75 

-0.000000  000014  630481  67 

o.oooooo  oooooo  010027  57 

-0.000000  000000  0QQQQ3  16 
a6  -0.000000  000014  620457  26 

0.000000  OOOOOO  001253  45 
-0.000000  OOOOOO  OOOOOO  70 
a8  0.000000  OOOOOO  001252  74 

al0  «  -  7.0  x  10-20 

The  resulting  economy  is  obvious. 

A  Tachebychev  series  is  rarely  encountered  in  its  basic  form.  The  compu¬ 
tational  power  and  efficiency  of  a  simple  polynomial  in  nested  form  ia  so 
greet  that  one  cannot  gainsay  its  use.  Hence  most  Tachebychev  series,  after 
truncation,  are  converted  to  this  form  by  substituting  for  the  Tn's  end 
collecting  terms. 

We  illustrate,  using 

\  IT  X 

-  sin  Yo  =  a0T0  +  a2T2  ♦  a4T4 

Thus 

,  _  xx 
a  in  -{I 

- —  s  a0  ♦  a2(2x2  -  1)  ♦  aft(8x4  -  8x2  ♦  1) 

=  (a0  -  a2  +  aft)  ♦  (2a2  -  8aft)  x2  +  8a4x<l 


Simplifying, 


sin  =  x(0. 157079  632665  -  0.000645  963834  x2  +  0.000000  796224  x4) 
Maximum  error  occurs  at  x  =  l  and  is  less  than  1.5  *  10*11,  exactly  as 

1  1TX 

predicted  by  a6 .  The  error  of  this  approximation  to  —  sin  -jg  is  shown  in 
F igure  2 . 

Note  that  when  a  Tachebychev  approximation  has  been  reconverted  to  a  sim¬ 
ple  polynomial  form,  there  is  no  practical  way  to  estimate  the  maximum  error 
by  inspecting  the  resulting  coefficients. 
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IX.  MAEHLY 'S  METHOD.  A  method  attributed  to  Maehly  develops  a  rational 
function  approximation  from  a  Tachebychev  series  in  much  the  same  way  that  a 
PadS  approximation  is  developed  from  a  simple  power  series.  The  basic  form  is 
like 


“o  ♦  o,T,  +  a2T2  ♦  a,T3  (32) 

i  +  elTl  +"82t7  +  s ,f,"  “  +  tlTl  +  ^T2  +  *••• 

Since  the  coefficients  of  a  rational  funotion  are  not  unique,  we  can  arbi¬ 
trarily  choose  any  one  of  them,  and  so  set  8q  =  1. 

As  before,  both  sides  are  multiplied  by  the  Haehiy  denominator  and  like 
terms  collected  on  the  right.  The  multiplication  requires  use  of  the  trigono¬ 
metric  identity 


-  T^n-f*  *  Tn+m)  (m  <  n)  (33) 

It  will  be  noticed  that  this  uses  terms  of  the  basic  Tachebychev  series  out  to 
a  degree  equal  to  the  sum  of  the  degree  of  the  Maehly  numerator  and  twice  the 
degree  of  the  denominator. 

The  computations,  if  systematized,  are  less  complicated  than  at  first 
appears.  If  we  designate  the  product-series  as 

Vo  ♦  **1^1  ♦  ^2*2  ♦  Tj  +  ••••  (34) 


then 


<0  : 

:  y o  +  {(Ml  «■ 

82Y2  +  83Y 

3  +  84Y4  +  • • • 

•  ) 

«1 

=  T 1 

♦  lB,(2Y0  +  Y2 

)  +  {8^!  ■ 

f  Yj)  +  {8 3 ( Y 2 

+ 

y4) 

+  {*4(Y3 

+  Y 

3)  +  ^8s(Y4 

♦  y6)  ♦ 

<2 

=  y2 

♦  T6i(y,  ♦ 

Y  j) 

'  ♦  {®  2 ( 2Y  q  ■ 

*■  Y4)  +  *2  P  3  ( Y  1 

■f 

Ys) 

+  {«4(y2 

+  Y 

4)  +  5 ( Y  3 

+  Y7)  + 

<3 

=  y3 

+  {8,(Y2  + 

y4) 

♦  |82(y  j  ♦ 

Yj)  4*  {@3(2Yq 

♦ 

Y  6> 

+  ^(Y., 

+  Y 

7)  ♦  {85 (y2 

+  Yg  )  ♦  .... 

=  T* 

♦  {my,  + 

♦  y®  2  ^  y  2  ♦ 

Y6)  ♦  je3(Y ! 

«■  y7) 

+  ^4(2Y, 

3  + 

Y0)  +  {®j(y 

\  +  Y  9  )  +  .... 

etc. 
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Within  sach  set  of  parentheses,  the  subscripts  of  the  are  given  by 
the  sum  and  difference  of  the  and  subscripts.  (If  a  negative  value 
occurs,  simply  use  the  absolute  value.)  When  the  subscripts  of  and 
are  equal,  the  coefficient  y0  will  appear  and  must  be  doubled. 

Since  any  reasonable  approximation  will  contain  a  finite  number  of  E^'a, 
each  of  the  expressions  for  will  terminate.  Notice  that  within  each  set 
of  parentheses  the  subscripts  are  either  odd  or  even  in  pairs.  This  means 
that  for  odd  or  even  functions,  half  the  parenthetical  terms  will  vanish. 

It  is  now  possible  to  develop  a  set  of  m  +  n  +  1  simultaneous  linear 
equations  in  the  same  number  of  unknowns  (the  a  *8  and  a).  That  the 
technique  is  adapted  from  the  Pads  method  is  obvious.  The  wanted  equations 
are: 


for  the  example  form  given. 

Exactly  aa  with  the  Pad6,  those  equations  urtiose  left  member  is  zero  are 
solved  for  the  81'a,  after  which  the  c^'s  are  found  by  direct  substitution. 

The  error  curve  of  a  Maehly  strongly  resembles  that  of  a  Tschebychev 
approximation.  In  fact,  it  is  usually  possible  to  select  a  Maehly  and  a 
Tschebychev  so  that  the  error  curves  have  the  same  number  of  ripples, 
similarly  spaced.  Under  these  conditions,  a  linear  combination  of  the  two 
approximations  can  achieve  fantastic  accuracy. 

Let  us  illustrate  the  method,  using  the  Tschebychev  series  for  9-*n  P-x  ; 
p  =  ~.  The  Tschebychev  coefficients,  previously  computed,  are  repeated  for 
convenience: 

Y0  =  0.156756  949331  824007 

y2  =  -0.000322  583805  008939 

Yfl  =  0.000000  099528  035006 

Y,  =  -0.000000  000014  620457 

O 

Ya  s  0.000000  000000  001253 
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We  choose  to  develop  a  Maehly  of  the  form 


.  a  +■  a  T 

sin  P-X-  *  J3 _ l—L  _  u 

x  i  «■  s2t2 


The  required  equations  are: 


O  sC  =  Y  +  Y 
0  0  0  2  *2  2 


«  =  5,  ='  +  58,  (2Y„  +  YA  ) 

2  2  2  2  2  0  4 


0  =  C.  =  Y  +  jB  (y,  +  Y, ) 

4  4  2  2  2  6 

The  last  equation  yields  &2  immediately. 

02  =  0.000617  067744  563840 

oQ  =  0.156756  849803  793512 

a2  =  -0.000225  854117  132272 
Now  and  &2  are  coefficients  of  T2  =  2x2  _ 


Hence 


(ou  •  a, )  +  2a,xz 


(1  -  B2)  +  2B2x2 

The  coefficients  of  a  rational  function  are  never  unique.  This  allows  ua 
arbitrarily  to  set  any  one  of  them.  Let  us  choose  unity  as  the  coefficient  of 
xz  in  the  denominator.  Thus,  dividing  through  by  2@2, 

M  127.200542  6502  -  0.366011  8668  x2 

809.783804  9871  «■  xz 

If  desired,  a  partial  division  will  produce  a  form  more  suitable  for 
subsequent  linear  combination.  Thus 


M  =  0.157079  632695  59 


0.523091  4995  x2 

809.783804  9871  +  xJ 


or,  better  still, 


M  =  -0.366011  866803  88  ♦ 


423.591024  8211 


809.783804  9871  +  x2 


We  wish  to  compare  this  approximation  to  the  Tachebychev  approximation 


T  =  Y0  +  Y2T2  ♦  yj, 
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which,  expressed  in  terms  of  x  ,  is 


sin  dx 


Y2  +  n>  ♦  (2t2  "  af4^2  +  ®Y< 


Y0  -  T2  +  Yi,  *  0.157079  632664  87 

2y2  -  8y<,  =  -0.000645  963834  298 

8ya  s  0. 000000  796224  2800 

An  inspection  of  the  error  plots  (Figure  3)  suggests  the  linear  combina¬ 
tion  0.524  T  +  0.476  M.  Performing  the  multiplication  and  combining  the 
resulting  constant  terms,  we  get 

a-ij-xp-  -  -  0.09  1  911  921082  257 

-  0.000338  485049  17211  x2 

+  0.000000  417221  522744  10  x4 

201.629327  81484 
+  809.783804  98711  +  x* 


an  approximation  so  accurate  that  a  thirteen-digit  calculator  could  detect  no 
error. 

Unfortunately,  the  functions  which  are  encountered  in  real  life  rarely  are 
as  well-behaved  as  is  the  common  sine.  But  even  though  at  first  glance  they 
may  look  solution-proof,  sometimes  a  little  guile  will  go  a  long  way.  For  an 
example,  let  us  return  to  that  recalcitrant  function,  the  inverse  tangent. 
Previously,  we  have  seen  that  if  z  =  tan  0, 


atctan  z 


1  2  1  4  1,6 

yz  +  yz  -72  ♦ 


We  can  restrict  our  interest  to  101  <  whence  |z|  <  1,  but  z  is  not  a 

suitable  Tschebychev  variable.  It  is  easy  to  see  that  the  ratio  of  successive 
coefficients  of  the  Maclaurin  series  tends  toward  the  limit  -1,  making  it 
useless  for  computing  Tschebychev  coefficients.  Moreover,  the  Tschebychev 
aeries  (for  the  transformation  x  =  z)  converges  rather  slowly,  requiring  a 
large  number  of  terms  for  any  stated  accuracy.  Altogether,  a  seemingly 
formidable  taak,  with  a  disappointingly  cumbersome  solution. 

Let  us  employ  some  of  the  tools  we  have  acquired.  First,  let 


1  ♦  /TT 
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Then  arctan  z  a  2  arctan  y. 


But  if  |  zi  <  1,  then  |  yi  < - =  /7~  -  1 

1  +  /T" 


We  make  the  parametric  substitution  y  =  px  and  set  x  s  1  when  y  a  /T~  -1. 

Thus,  p  =  /5~  -  1.  This  is  a  particularly  handy  transformation,  since 
p-i  a  2  +  p  and  -  p*  a  2p  -  1.  The  solution  then,  is 

arctan  ;  a  2  arctan  px 

We  will,  however,  compute  the  Tschebychev  coefficients  for 
arctan  px 


x  s  +  V*  +  V*  * 

in  order  to  avoid  a  zero  within  the  stated  argument  range* 

Now  p  is  a  constant  (— p2  ■  -0.17157  •*•),  and  hence  the  various 
powers  of  p  in 


(36) 


arctan  ox  oJ  p*  d7 

- — a  p  -  ~  x7  +  ~  x4  -  Jiyx6  +  *>*  (37) 

become  part  of  the  coefficients.  Convergence  is  quite  rapid,  and  the 
computations  are  easily  made  on  a  programmable  calculator.  The  resulting 
Tschebychev  series  is: 


arctan 

PX 

X 

=  To  + 

Y2  T2  + 

V*  + 

•  ♦ 

_ z_ 

.{1±  pi 

p  r  - 

1  ♦ 

/T~nr- 

•1 

G 

0.403199 

719161 

511495 

80 

Y2  = 

-0.010749 

968804 

390963 

96 

= 

0.000256 

378716 

684566 

71 

Y6  = 

-0.000007 

264267 

589573 

12 

Tfl  = 

0.000000 

223914 

266710 

62 

II 

o 

-0.000000 

007256 

851307 

14 

Y1  2  S 

0.000000 

000243 

155037 

30 

Yu  2 

-0.000000 

000008 

343268 

50 

= 

0.000000 

000000 

291421 

29 

CD 

II 

-0.000000 

000000 

010320 

90 

(36) 
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Y20  s  0.000000  000000  000369  59 
Y22  =  -0.000000  000000  000013  36 


4.86  x  10-19 


Y,,  »  -1.78  x  10-20 

c  6 


From  these  coefficients,  let  us  compute  the  simplest  possible  Maehly: 


1  -  v2 


Thus, 


*0  =  ?0  8  Y0  +  2B2Y2 


0  =  C2  =  Y2  +  1*2  (2Y0  +  V 


8  =  - - i — 

2  1 
y  +  —y. 
0  *  4 


82  =  0.026653  173701  372558  84 
aQ  =  0.403056  458768  597611  48 


1  +  82(2x2  -  1)  =  (1  -  B2)  +  2B2x2 


Dividing  through  by  282, 

arctan  px  7.561134  431579 

x  =  18.259492  044066  +  x2 

The  relative  error  in  this  approximation  does  not  exceed  2.9  x  10-4.  Figure  4 
compares  this  error  with  that  of  the  simple  Tschebychev  approximation 
T  =  yQ  +  y2T2.  Also  shown  is  the  error  curve  for  the  linear  combination 

1.78981  H  -  0.78981  T. 


This  error  curve  (for  the  linear  combination)  is  shown  on  an  expanded  scale  in 
the  lower  graph,  where  it  clearly  reveals  the  shape  of  the  SECOND  missing 
Tschebychev  polynomial.  It  can  be  shown  that  the  linear  combination  method 
produces  an  approximation  virtually  as  accurate  a3  the  Tschebychev  (or  Maehly) 
of  next  higher  order. 
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X.  ASYMPTOTIC  SERIES  -  A  LOOK  AHEAD.  The  author  presently  Is  investi¬ 

gating  the  theory  of  asymptotic  series,  first  results  are  very  promising  and 
may  warrant  e  future  paper.  Some  tentative  findings  erei 

1.  To  be  useful,  an  asymptotic  series  mjst  be  an  alternating  series. 

2.  Provided  the  value  of  the  argument  is  not  too  small,  there  will  be  a 
smallest  term  which  is  not  the  first  term.  "Smallest"  is  taken  in  the  sense 
of  the  absolute  value.  There  may  be  two  consecutive  smallest  terms  (equal  in 
absolute  value  but  opposite  in  sign,  of  course). 

3.  After  the  smallest  term,  the  series  diverges.  None  of  these  divergent 
terms  should  be  included  in  any  approximation. 

4.  Truncation  immediately  after  the  smallest  term  produces  an  approxima¬ 
tion  which  errs  less  than  that  produced  by  truncotion  at  any  other  point. 

There  is  a  widespread  — -  but  mistaken  -  belief  that  this  represents  the 

best  aporoximation  of  which  the  asymptotic  series  is  capable.  It  is  simply 
not  true,  as  we  shell  see  immediately  when  we  pursue  the  New  Look. 

5.  If,  in  place  of  the  first  divergent  term,  there  is  substituted  a  term 
of  like  sign  whose  absolute  value  is  exactly  half  the  absolute  value  of  the 
smallest  term,  there  is  a  definite  reduction  in  the  error  of  the  approxima¬ 
tion,  often  by  more  than  an  order  of  magnitude. 

6.  If  the  remaining  error  is  plotted  as  a  function  of  the  argument,  it  is 
seen  to  be  a  "sawtooth."  See  [9]  for  an  example.  This  function  appears  to 
possess  continuous  derivatives.  If  so,  it  is  a  sufficient  condition  for  the 
existence  of  an  exact  analytical  expression  (e.g.,  a  Fourier  or  Tschebychev 
series) . 

7.  Discovery  of  such  an  expression  is  the  next  logical  step.  (Only  an 
approximation  wa3  developed  in  [9].) 

8.  The  composite  expression  should  converge  for  all  values  of  the  argu¬ 
ment  down  to  include  the  point  where  the  sum  of  the  first  two  terms  of  the 
asymptotic  series  is  zero,  thereby  reducing  the  total  expression  at  that  point 

to  something  like  f(z)  -  \  *  e(z).  For  two  commonly  used  asymptotic  series, 

this  minimum  value  of  the  argument  may  turn  out  to  bei 

for  lner(z),  z  =  -  0.289  ([2]  and  {8]). 

for  erfc  z,  z  s  y/f  -  0.707  ([9]). 
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APPENDIX  A 


AN  ALGORITHM  FOR  SQUARE  ROOT 

Before  actually  developing  an  approximation,  let  us  address  a  few  ancil¬ 
lary  matters. 

First.  The  Newton-Raohson  technique.  If  t1  ia  any  reasonable  estimate 
of  /z  ,  then 


Vi  *  t  h  *  /i 


is  a  better  one. 


The  process  can  be  repeated  endlessly  and  is  found  to  converge  quadrat ically 
upon  the  true  value.  Examining  the  form  of  the  iterative  equation  suggests 
that  the  first  estimate  be  in  the  form  of  a  rational  function  approximation, 
since  if  r,  :  N/q,  then 

I  rN  zDi  N2  ♦  zD2 

r2  =2  Id  +-p  “  2DN  *  (38) 


thereby  saving  a  division.  In  fact 


1  f  N2  ♦  zD2 


2zDN 
N2  ♦  zDi 


•It 


:n2  +  zD2)2  AzP2N2 
2DN(N2  +  zD2) 


N*  +  6zD2N2  +  z2D6 
4DN  (N2  *  zO2) 


and  still  another  division  ia  saved. 

Second.  The  bilinear  transformation.  Simply  stated, 

wz  +  aw  ♦  bz  +  c  s  0 

w  and  z  ere  variables,  a,  b  and  c  are  coefficients,  any  of  which  might 
be  complex.  Should  the  term  in  wz  appear  with  a  coefficient  other  than 
unity,  division  by  that  coefficient  will  produce  the  above  form  without  loss 
of  generality.  The  expression  can  be  regarded  as  linear  in  w  or  as  linear 
in  z,  but  is  NOT  linear  in  both  together?  hence,  the  term  "bilinear."  It  is 
easy  to  solve  for  either  variable  in  terms  of  the  other,  viz., 

wz  ♦  aw  =  -  bz  -  c 
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Two  (or  more)  successive  bllinesr  transformations  can  always  be  replaced  by  a 
SINGLE  bilinear  transformation  (with,  of  course,  different  coefficients).  Let 
us  examine  the  bilinear  tranformation 


w 


z  +  1 


Solving  for  z, 


z 


1  +  w 
1  -  w 


(42) 


As  z  varies  through  the  argument'  range  0  to  •,  w  varies  from  -1  to  +1, 
suggesting  that  a  transformation  of  this  or  a  similar  form  (b  -  -1,  c  a  a) 
will  be  useful  for  square  roots,  logarithms,  or  any  function  whose  argument 
must  be  non-negative. 

Third.  Catering  to  very  large  argument  ranges.  Superficially,  it  would 
seem  that  the  bilinear  transformation  would  be  enough.  But  we  find  severe 
warping  near  the  band  edges  (aa  w  ♦  ±1  in  the  above  example).  This 
introduces  on  acute  scaling  problem  similar  to  that  of  the  tangent  function 
near  90  degrees.  Clearly,  an  additional  device  is  needed. 

A  possible  approach  is  to  develop  several  approximations,  each  for  use 
with  a  different  stated  argument  range.  Since  this  method  would  seem  to 
require  more  perseverance  than  ingenuity,  it  will  not  be  further  pursued  in 
this  paper. 

Instead,  we  shall  define  a  process  and  call  it  "normalization."  Basi¬ 
cally,  it  amounts  to  a  transformation  which  separates  a  floating  point  number 
into  exponent  and  mantissa.  The  floating  point  number  can  be  expressed  to  any 
convenient  base.  The  approximation  is  applied  to  the  mantissa,  after  which  a 
suitable  inverse  transformation  recovers  the  desired  result.  For  the  square 
root,  this  amounts  to  dividing  by  any  arbitrarily  chosen  perfect  square,  com¬ 
puting  the  approximation,  then  multiplying  by  the  perfect  root. 


/z  = 


A  suggested  algorithm  follows. 

Consider  separately  0  <  z  <  1  and  1  <  z  <  .. 
Oenote  the  latest  transform  by  #. 
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Case  2.  0  <  z  <  1. 


Either  apply  the  "normalization"  algorithm  to  the  reciprocal,  ,  or 

replace  all  the  constants  within  the  algorithm  by  their  reciprocals  (and  > 
by  <). 

The  resulting  normalized  argument  will  always  lie  in  the  range  0.2  <  z*  <  5 
This  suggests  the  bilinear  transformation 

z*  -  1 

px 

Setting  x  s  1  when  z*  s  5,  we  find  that  p  =  <|.  If  we  can  find  a  power 

series  for  the  square  root  function,  it  should  be  easy  to  compute  the 
Tschebychev  coefficients  for  this  transformation,  and  hence  the  Maehly 
rational  function. 


1  ♦  ox 
1  -  px 


■  »  P*H1  -  px) 
(1  -  px) 2 


(1  -  px)  s  (1  -  p2x2)1/2  (45) 

and  the  right  side  can  be  expanded  by  means  of  the  binomial  theorem,  yielding 

^1  -  P2*2  =  1  -  j  P2*2  -  ~  p*x4  -  p6x6  -  p8x8 

- Z_  p10x10 _ 11 L  n12X12 _ LL  n  1 4X 1 4  -  ... 

2*6  P  1024  P  2048  P 

The  general  term  (after  the  first)  is 

-  |2n-2  p2nx2n 

_____ -  ( 47  ) 

2  2n-1  |n  |n- 1 

and  the  recurrence  ratio  is  2 - *p2x2,  Inserting  the  value  p  =  1,  the 

2  |2n- 2  X2n  4n  -  6 

general  term  becomes  -  -ttz -  and  the  recurrence  ratio  —  ■sf —  X2. 

|n  |n- 1  “n 

The  series  for  computing  the  coefficients  of  the  Tschebychev  series  require 
the  additional  multipliers  (for  the  general  term)  of 


2  2n-1  |n.  a  |n  + > 


where  Y  is  the  Tschebychev  coefficient  being  computed  and  ^  ie  a  term 

2  •  2n 
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. -  -  .  r 

°LtfVerie8  U8ed  *°  comPute  it;-  Temporarily  skipping  over  the  problem  of 
computing  y0,  we  find  that  the  first  non-zero  term  in  the  aeries  for  v, 

a  !  3»  ***'  occ«r8  when  n  a  a.  The  expression  for  it  simplifies  ?o 

-4  12n-  2 
6*  """in  ln-1 

The  recurrence  ratio  is  ^0  -T  ■  ^?n.~  X), 

9{n-'  -  az) 

the  first  SS£2£^  5*™  of  the  8erie8  for  Yn  is  identical  with 

ush  t#r?  1  r?T  Y?*  The  recurrence  ratio  is  the  same.  Only  the 

value  of  8  is  different.  Wherefore  we  compute  this  term,  reset  a  to  zero 
and  sum  the  resulting  series  —  yielding,  of  course,  Yfl  -  ' 

Thus  we  have  the  Tschebychev  series 


(1  -  px)  Jz*  = 


Y4T4  + 


where 


T*  -  1 
Z*  ♦  1 


pH 


y8  = 

y  i  a  s 
Yl2  = 


0.877328 
-0.126982 
-0.004619 
-0.00033 6 
-0.000030 
-0.000003 


215244  754 6 
320508  2891 
211325  3075 
513800  0460 
660625  1123 
129636  9144 


=  -0.000000  342323  7095 


Yu  = 
Yu  = 


=  -0.000000 


-0.000000 


=  -0.000000 


Y2o  s 
Y22  = 
Yz4  8 
Y26  = 
Y28  8 

Yin  8 


-0.000000 
-0.000000 
-0 .000000 
-0.000000 
-0 .000000 
-0 .000000 
-0 .000000 


039230  6243 
004649  4341 
000565  1841 
000070  0800 
000008  8292 
000001  1270 
000000  1454 
000000  0189 
000000  0025 
000000  0003 
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It  seems  possible  that  a  Maehly  aa  simple  as 

(1  -  px)  /z*  * 


a.  ♦  a„T„  u 
0  2  2  _  N 


1  +  BjTj  w 

may  be  sufficient  for  our  needs.  Before  computing  the  Maehly  coefficients, 
however,  let  us  take  notice  of  how  the  bilinear  transformation  we  have  used 
simplifies  the  application  of  the  Newton-Raphson  technique.  Of  course 


r.  = 


N 


i,  7  0(1  -  px) 


Then 


2  <-o(l  -  px)  + 


B*1 


px) 


P(1  -  px) 
N 


] 


*2  “  20(1  -  px)  + 


0(1  +  ox) 
2N 


which  combines  to 


In  similar  fashion. 


_  N2  +  D*(l  -  d2x2) 
C2  "  20N(1  -  px) 


N2  »  P?(l  P2*2).  _ DN,(l,_t-P.xI 

AfSfcl/l  _ \  T  i  _ 


4DN(1  -  px) 


M2  +  D*(l  -  p2x2) 


(49) 


which  states  the  final  solution  in  terms  of  the  original  Maehly  approximation. 
The  Maehly  coefficients  arei 


*0 


=  0.881935  217627 


o2  =  -0.190474  825654 
02  s  -0.072561  319783 

Since  the  coefficients  of  a  rational  function  are  not  tnique,  neither  are  N 
and  0  —  only  their  ratio  is.  Thus  there  are  a  limitless  number  of  forms 
in  which  they  can  be  expressed.  Perhaps  the  simplest  is 


<}  -  x)  /z* 


11.08452  -  3.93753  x* 
7.39072  -  x2 
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In  final  form 
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o  =  C6  s  Y6  ♦  762(^4  +  YB)  ♦  ♦  Y,o) 


0  :  ;8  =  Yg  ♦  {MY*  +  Y10)  ♦  784(^4  ♦  Y1Z) 


Computing, 


a0  *  0.891040  789794 


Oj  a  -0.316214  857627 
aA  a  0.011427  219988 
P2  a  -0.216071  134840 
64  a  0.002611  917418 


Simplification  results  in 
(\  -  x)  Sz*  * 


17. 4847865  -  51.9623632x2  +  6.5625467x4 
58.3231998  -  21.6812755xJ  V  x4 


D(2  +  X)  1. 


20(f  -  x) 


At  x  a  1  (z*  a  5),  this  approximation  errs  by  10*13.  To  obtain  that  accuracy, 
the  coefficients  must  be  stated  to  five  or  six  decimal  places  (seven  are 
shown) . 


APPENDIX  B 


CUBE  ROOT 


Many  of  the  problems  involved  in  computing  the  higher  roots  are  similar  to 
those  of  the  square  root  and  submit  to  similar  solutions.  Additionally,  the 
odd-numbered  roots  admit  negative  values  of  the  argument.  So  saying,  the 
matters  of  "normalization"  and  accounting  for  sign  are  left  to  the  reader. 

To  develop  a  power  aeries  for  cube  root,  we  employ  the  transformation 

z*  -  1 


Solving  for  z*,  z*  -  j' 


U-r  £*)2 
[1  -  px)J 


so  that 


2/3 

(1  -  px)  S  (1  -  px) 

1  2  1  7  1  _ 4  1  i 

=  1  -  -jP*  -  ,pzxz  -  eTP  * 

_ Ln4x4 _ . 

243^*  729P  X 


(50) 


Let  us  develop  a  Pad6  approximation  of  the  form 


(1  -  px)  K* 


1  ♦  atpx  ♦  a?p2x2  ♦  a}pJx3 
1  +  t^px  ♦  b2p2x2 


(51) 


With  a  Padd,  it  is  a  matter  of  indifference  whether  we  restrict  the  argument 
range  before  or  after  developing  the  rational  function.  Therefore,  in  the 
interest  of  convenience,  set  p  s  1.  The  simultaneous  system  of  equations  then 
is 


a2 
8  3 


0 

0 


0 

0 

0 


2h  v 

“  3b2  ■  9bl  “  81 

“  0  ‘  9b2  ‘  81b1  ~  243 

n  n  4.  7  .  Ill 

-  0  -  0  -  fl1b2  -  24 3b 1  “  729 


Solving, 


h 


8  2  _ft 

n1  =  "  5  B2  s  1  ®3  =  "  81 

Dividing  both  aides  of  the  expression  by  1  -  x  and  multiplying  all 
coefficients  by  their  L.C.D.  yields 


405  -  648 x  +  270x2  -  20x3 
“  405  -  783x  +  441xf  -  63x3 


It  is  desirable  to  choose  an  argument  range  (for  the  "normalized"  variable) 
the  ratio  of  whose  end  points  is  a  perfect  cube  (e.g.,  8).  Noticing  that  x 
passes  through  zero  as  z*  passes  through  unity,  we  can  see  at  once  that 

<  z|  <  1  and  1  <  z*  <  8  will  be  a  good  choice  for  these  end  points.  It 

turns  out  that  in  the  range  0.35  <  z*  <  2.8,  the  relative  error  is  less  than 
5  x  10**.  However,  we  can  do  nearly  an  order  of  magnitude  better  than  that. 


The  two  term  divisor  enables  us  to  use  the  original  series  as  a  Padd  (of 
which  it  is  a  special  case)  and  write,  after  multiplying  by  405t 
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Subtracting  and  reducing  to  lowest  terms,  we  find 


The  Padd  is  "optimized"  at  P  -  0.84  from  which 

.  »05  -  652. 8x  +  274x2  -  20x3 

z  405  -  787. 8x  ♦  446. 6xs  -  63 .8x3 


Within  the  argument  range  0.38296  <  z#  <  3.06853  the  relative  error  does  not 
exceed  7.167  x  10* *.  The  ratio  of  these  end  points  slightly  exceeds  8  —  a 
perfect  cube  —  so  that  the  upper'  limit  of  z*  (for  which  the  "normalization" 
routine  searches)  arbitrarily  can  be  set  anywhere  between  3.0637  and  3.0685 
- —  say  3.066. 

For  any  odd-numbered  root,  there  is  a  specialized  adaptation  of  the 
Newton-Raphson  technique  which  converges  more  rapidly  than  any  other.  Before 
developing  it,  however,  let  us  review  Newton-Raphson  in  simple  form: 

Let  f (y)  s  z  -  y"  (n  s  3,  5,  7,  9,  ...) 

f ' (y)  =  -ny0*1 


Suppose  we  have  an  estimate  of  the  root,  y1  .  We  also  know  that  at  the 
true  root,  y,  f(y)  =  0.  This  enables  us  to  write  the  approximation 


f(Vi)  -  0 
Vi  -  * 


(52) 


Now  y  ie  the  only  unknown,  but  this  is  only  an  approximation,  so  we  will  not 
recover  y,  but  y2 ,  another  (and  hopefully  better)  estimate.  Substituting 
and  rearranging, 


f<yt> 

y2  S  yi  *  f'(yi) 

z  -  y.n 


y2  s  n  t(n  -  v  y^  +  t^=t]  <53> 

Examining  the  second  derivative,  it  is  found  that 

f "  (y)  a  -  n  (n  -  1)  y"*2.  (54) 

The  curvature  is  not  negligible,  and  increases  with  higher  roots  - 

introducing  error  into  the  approximation  snd  thereby  slowing  the  rate  of 
convergence. 
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We  could  just  ea  well  have  written  the  original  expression  in  the  form 

y^  f (y)  a  2  -  yn  (n  =  3,  5,  ?,  9,  •••)  (55) 

The  left  aide  is  still  a  function  of  y  which  drives  to  zero  at  the  desired 

n-  1 

root.  But  y  ^  s  0  only  for  the  trivial  case  y  =  z  a  0.  Thus  we  can  apply 
the  Newton-Raphson  technique  to 

n+1 

f(y>  *  -^r  ‘  (56> 


But  now  look  at  the  derivatives!! 


<"< v)  •  -  -rb  - 


2  ,  2  .  n-  i 

«■■■(»>  «  (M-1)  -sb-  y-1" 


It  is  seen  that  there  ia  a  point  of  inflection  exactly  at  the  desired  root. 
This  means  that  aa  the  estimate  approaches  the  true  value,  the  slope  becomes 
virtually  constant,  thereby  hastening  convergence.  The  rate  of  convergence  is 
never  worse  then  quadratic,  and  ultimately  tends  toward  order  of  magnitude  n. 

Perhaps  this  process  is  beat  expressed  in  digital  filter  form;  i.e.,  as  an 
output/input  ratio.  Thus 


y2  (n  +  l)z  +  (n  -  l)y1n 
y1  "  (n  -  l)z  +  (n  +  Dy," 


For  the  cube  root,  this  reduces  to 


2z  + 


y,  "  z  +  2yt* 


To  cite  an  example,  we  find  the  following  errors  in  the  approximation  to 

to  : 

after  Padd  ("optimized"),  -0.000092 

after  modified  Newton-Raphson,  -0.28  *  10* 1Z 
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APPENDIX  C 


LOGARITHMS  TO  ANY  BASE 

The  argument  is  "normalized”  to  a  value  in  the  range  3.165  >  z*  > 
(3.165) -1.  The  ratio  of  these  two  limits  slightly  exceeds  10.  The  method  is 
strikingly  similar  tu  that  used  for  square  root.  The  parameter  h  is  set  to 
zero. 

If  z*  *  10*", 

then  h*  *  h  +  n.  (61) 

After  the  approximation  ha9  been  applied, 

logb  z  =  logb  z*  ♦  h*logb  10.  (62) 

The  most  commonly  used  bases  are  2,  e,  10,  and  16.  The  necessary  constants 
are  found  in  the  first  part  of  this  paper. 

We  develop  s  Maehly  rational  function  for  lnez*  using  the  transformation 

z*  -  1 

px  =  z*"+"T  »  P  s  0,52 

Thus  z*  =  p-*  ,  and 

2~lnez*  sp+£3x2+£^x4+£7x6-»-  **’  (63) 

The  coefficients  of  the  Tschebychev  series  are  now  computed.  They  arej 

Y g  s  0.546850  950695  9441 
Y2  =  0.028096  097358  0741 
Y 4  =  0.001314  425494  3168 
Y6  =  0.000073  490993  0960 
Y8  =  0.000004  482077  7161 
Y10  s  0.000000  207824  8525 
Y12  =  0.000000  019125  8782 
Y16  =  0.00000L  001302  1921 
Yu  =  0.000000  000090  2873 
Yie  s  0.000000  000006  3490 
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Y20  «  0.000000  000000  4515 

y22  #  0.000000  000000  0324 

Y2ft  s  0.000000  000000  0023 

Y26  =  0.000000  000000  0002 

A  Maehly  rational  function  of  the  form 


is  now  developed. 


2V1".  *' 


a  ♦  a  T  +  a  T  +  4  T 

-JL.  :.JLJL  -JLA-  JLA 

1  ♦  82^  +  &4T4  +  SfiTg 


a0  * 

0.543339 

498483 

1167 

a2  = 

-0.108861 

411680 

5337 

II 

0.002641 

510414 

0964 

“6  = 

-0.000008 

962142 

2536 

02  s  -0.250375  253205  3509 
3ft  s  0.008877  426771  5024 
06  =  -0.000076  902174  0061 

At  x  =  1,  (2*  =  19/6),  the  relative  error  is  -4.957  x  10'12. 


APPENDIX  D 

RECOVERING  AN  ANGLE  FROM  RECTANGULAR  CO-ORDINATES 


Let  us  label  the  sides  of 
a  right  triangle  X,  Y  and  R. 


Given  any  two,  it  is  required  to 
find  the  angle  0. 

We  have  immediately  that  R2  s  X2  ♦  Y2.  (64) 
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Host  programmers  unhesitatingly  (and  unthinkingly!)  select  one  of  the  common 
ratios 


Y 

X 


Y  X 

tan  0,  y  =  sin  6,  —  =  co9 


6 


and  compute  the  inverse.  There  are  serious  objections  to  this  cavalier 
approach: 


(1)  Near  6s  90° ,  tan  6  presents  scaling  difficulties  as  the  slope 
increases  without  bound. 


(2)  Also  near  6  =  90°,  sin  6  becomes  a  most  imprecise  measure  of 
angle,  since  the  slope  approaches  zero,  rendering  the  function  insensitive  to 
changes  in  the  argument. 

(3)  Near  6  :  0,  cos  6  exhibits  the  same  disadvantages,  plus  the 
additional  one  of  failing  to  change  sign  as  6  passes  through  zero. 

The  answer  usually  is  taught  during  the  first  week  of  most  college  trigono¬ 
metry  courses,  then  promptly  forgotten: 


6  Y 
2  =  X  +  R 


(65) 


0  7T 

The  function  tan  j  behaves  very  well  indeed.  Provided  -  y  <  6  <  j  , 

e 

the  absolute  value  of  tan  y  varies  between  0  and  1,  while  ils  slope 

varies  between  1  and  2.  This  fundamental  identity  appears  in  many  forms, 
two  of  which  are 


tan  S.  -  _ tan_S _ 

*  - 1  ♦  .0. ,  - ,  ,  .T-rnsr-j 

Applying  the  identity  a  second  time  yields  (since 
Y2  +  (X  +  R)2  a  2R  (X  +  R))i 


tan  £  =  - Y  —  (66) 

(X  ♦  R)  +  /2R(X  +  R) 

For  simplicity  in  notation,  let  Q  =  /zk(x  +  k) .  Then 

e  Y 

tan  *  =  -  ■  —  a  t 

(X  +  R  a  Q)  +  /2(2R  +  Q)(X  +  R) 


or 


t  a  tan  “I  a  - ~  - 

(X  +  R  ♦  Q)  +  /2Q  ( X  +  R  +  Q) 


(67) 
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The  Pad6  approximation  now  is  used  to  recover  the  angle  8  .  It  ia 


9=8  srctan  t  =  8t 


/l89  +  147t2+  I2.8t4\ 
^  189  +  210 t 2  ♦  45t«  / 


The  error  in  0  ia  leas  than  2.2  x  10- '0  radians. 

Combining  factors  and  employii  n  r- *  -d  polynomial  form  yields 


_  t  ,(.,-/....t2  +  1176)t2  +  15121 
radians  "  4"  t2  +  210)t2  ♦  189  ' 


or 


.  _  t_  f ( 1228 . 8 1 2  ±  14112) t2  +  18144! 

oegreas  ~  it  l  (3t2  +  14)t2  +  12.6  ' 

It  is  to  be  remarked  that  in  order  to  achieve  this  accuracy,  a  Maclaurin 
series  would  have  to  employ  the  t1 J  term,  while  a  Tschebychev  aeries 
requires  six  coefficients  expressed  to  10-digit  accuracy. 
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MOS  TRAINING  COURSE  SELECTION 
CRITERIA:  AN  APPLICATION  OF  DISCRIMINANT  ANALYSIS 

Pat  Cassidy  and  Lounall  Snodgrass 
Analysis  Branch  II 

Training  Efface iveneas  Analysis  Division 
US  Army  TRADOC  Syatcas  Analysis  Activity 
White  Sands  Missile  Range,  New  Mexico 

I.  INTRODUCTION 

This  Is  a  study  of  criteria  by  which  soldiers  are  selected  for  Military 
Occupational  Speciality  (MOS)  training  schools.  Three  distinct  MOS's  and 
their  associated  training  courses  are  considered.  For  simplicity,  they  will 
be  referred  to  as  MOS  A,  MOS  B,  and  MOS  C. 

Intelligence  screening  of  new  recruits  Is  accomplished  with  the  Armed 
Forces  Qualification  Test  (AFQT).  Job  or  occupation  qualifications  are 
determined  with  the  Armed  Forces  Vocational  Aptitude  Battery  (ASVAB) .  These 
tests  are  described  In  Tables  1  and  2.  In  the  development  of  ASVAB,  training 
course  performance  was  taken  as  the  measire  of  soldier  performance.  Aptitude 
composites  were  developed  to  maximize  validity  coefficients.  Consequently, 
the  composites  are  composed  of  several  tests  and  are  highly  intercorrelated.^ 
For  a  description  of  the  development  of  the  aptitude  composites  see  Fuchs  and 
Maler  (1973,  78).  Composite  scores  normally  range  from  40  to  160,  with  an 
average  score  near  100  and  standard  deviation  near  20. 

Typically  the  selection  criteria  for  a  specific  training  school  (course) 
will  consist  of  a  minimum  score  on  a  single  ASVAB  composite.  Unlike  raw  test 
scores,  aptitude  composites  are  maintained  In  a  soldier's  personnel  file  where 
they  can  be  easily  obtained  by  a  particular  school.  Rarely,  minimum  scores  on 
two  composites  may  be  required.  As  weapon  systems,  the  Army  population,  and 
training  courses  have  evolved;  some  schools  have  experienced  high  attrition 
rates.  TRADOC  Systems  Analysis  Activity  (TRASANA)  was  asked  to  study  samples 
from  three  MOS  school  (courses)  and  recommend  improved  selection  criteria. 


TABLE  1 


TESTS  IN  THE  ARMED  SERVICES  VOCATIONAL  APTITUDE  BATTERY  (ASVAB) 


General  Ability 

Tests 

Arlthnetlc  Reasoning 

General  Information 
Mathematics  Knowledge 
Science  Knowledge 

AR 

GI 

MK 

SK 

Word  Knowledge 

WK 

Mechanical  Ability 

Automotive  Information 

AJ 

Tests 

Electronics  Information 

El 

Mechanical  Comprehension 

MC 

Tirade  Information 

TI 

Perceptual  Ability 

Attention  to  Detail 

AD 

Tests 

Pattern  Analysis 

PA 

Classification 

Attentiveness  Scale 

CA 

Inventory 

Combat  Scale 

CC 

Electronics  Scale 

CE 

Maintenance  Scale 

CM 
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TABLE  2 


Aptitude  Area 

Symbol  Title 

Composite 

ACB  Tests 

CO 

Combat 

AR+TI+PA+AD+CC 

FA 

Field  Artillery 

AR«GI«MK+EI4CA 

EL 

Electronics  Re¬ 
pair 

AR+EI*MC+TI4CE 

OF 

Operators  and 

Food 

GI+AIKA 

sc 

Surveillance  and 
Communications 

AR4WK4MC-*PA 

MM 

Mechanical  Main¬ 
tenance 

MK+AI+EI*TI<M 

GM 

General  Mainte¬ 
nance 

AR+SK+AUMC 

CL 

Clerical 

AR+WK+AD+CA 

ST 

Skilled  Technical 

AR«MK+$K 

GT 

General 

AR^K 
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ARMY  JOBS 


Major  Ralated  Jobs 


infantry,  Armor,  Combat  Engineer 


Field  Cannon  and  Rocket  Artillery 


Missiles  Repair,  Air  Defense 
Repair,  Tactical  Electronics 
Repair,  Fixed  Plant  Communication 
Repair 


Missiles  Crewman,  Air  Oefense 
Crewnan,  Driver,  Food  Services 


Target  Acquisition  and  Combat 
Surveillance,  Communication 
Operations 


Mechanical  A  Air  Maintenance,  Ralls 


Construction  and  Utilities, 
Chemical,  Marine,  Petroleum 


Administrative,  Finance  Supply 


Medical,  Military  Police,  Intel  11 
gence,  Data  Processing,  Air 
Control,  Topography  and  (Vlntlng 
Information,  and  Audio  Visual 


Used  only  to  qualify  for  special 
tests,  as  Officer  Candidate  Test 


Of  course,  high  attrition  rates  might  be  remedied  by  Improving  the  courses. 
This  remedy  has  been  done;  however,  this  was  not  part  of  the  TRASANA  study. 

II.  DATA 

One  sample  was  provided  for  such  MOS.  The  current  selection  criteria, 
the  number  passing  and  falling,  course  scores,  ASVAB  composite  and  AFQT  scores 
were  available  for  each  MOS  sample.  The  type  of  failure  -academic, 
non-academic  was  known  for  MOS  C.  These  data  are  sunmarlzed  In  Table  3. 


TABLE  3 


"Horn 

“  CURRENT - 

SELECTION 

CRITERIA 

— mm — 

PASSING 

COURSE 

- NUMBER - 

FAILURES 

- NORBCR - 

NON-ACADEMIC 

FAILURES 

A 

EL  >  90 
a  7  90 

114 

69 

N/A 

8 

EL  >  90 

227 

78 

N/A 

c 

EL  l  120 

109 

73 
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III.  ANAL  VS  IS 

Stepwise  discriminant  analysis  was  the  technique  chosen  to  determine 
Improved  selection  criteria.  This  method  produces  a  linear  combination  of 
ASVAB  composites  which  best  discriminates  between  the  pass  and  fall  groups. 
This  linear  discriminant  faction  allows  the  Incorporation  of  posteriori 
probabilities  and  the  costs  of  mlsclasslflcatlon.  The  resulting 
classification  procedure  minimizes  the  expected  cost  of  mlsclasslflcatlon 
under  certain  conditions.  For  a  description  of  discriminant  analysis  see  A.A. 
Aflfl  and  S.  P.  Azen  or  T.  W.  Anderson. 
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A.  MOS  A  ANALYSIS 

This  analysis  produced  MM  and  ST  as  the  variables  which  best 
discriminate  between  the  two  groups.  For  simplicity  of  application,  selection 
criteria  are  traditionally  given  as  minimum  scores  on  one  or  two  (rarely) 
composites*  Consequent! y ,  the  linear  discriminant  function  Is  not  a 
practical  classification  tool.  To  determine  a  more  practical  classification 
procedure,  the  114  sample  cases  were  ranked  first  by  HI,  then  by  ST.  The 
following  classification  procedure  was  then  determined:  If  MM  £  100  and 
ST  >_  100  classify  as  pass;  otherwise  classify  a*>  fall.  A  graphical  comparison 
of  the  two  procedures  Is  given  In  Figure  I.  The  attrition  rate  using  the 
proposed  MM/ST  criteria  would  be  15.5%,  while  reducing  the  number  of  soldiers 
chosen  for  the  course  by  56  (see  TABLE  4). 

An  alternate  criterion,  proposed  by  Army  School  A,  using  EL  ^  105 
was  also  considered.  Course  attrition  and  the  course  attendees  available  for 
this  sample  are  shown  In  TABLE  4. 

TABLE  4 

RELATIVE  EFFECTIVENESS  OF  THREE  ALTERNATIVE  COURSE 
SELECTION  CRITERIA  FOR  MOS  A 


SELECTED 


ACTUAL: 

tL  a  CL  l  90 

114 

69 

45 

39.5 

ALTERNATIVES: 

MM  >  100, 

ST  l  100 

58 

49 

9 

15.5 

EL  >  105 

56 

42 

14 

25.0 

239 


n J  "r'""  —  i  tmmmBUSSfBSitS  l!.1"  IM" 


B.  MOS  B  ANALYSIS 

This  analysis  mbs  different  from  the  HOS  A  analysis  since  all 
criteria  considered  Included  the  original  EL  criteria.  The  discriminant 
analysis  chose  SC  as  the  best  predictor  of  passing  or  falling  the  NOS  B 
course.  Chi-square  tests  Mere  performed  to  test  for  the  Independence  of  a 
selection  criterion  from  the  pass  or  fall  classification.  Table  5  summarizes 
the  proposed  criteria,  their  attrition  rates,  and  the  number  of  soldiers 
selected  from  the  sample. 


TABLE  5 

RELATIVE  EFFECTIVENESS  OF  ALTERNATIVE  COURSE  SELECTION 
CRITERIA  FOR  MOS  B 


SELECTION 

ATTENDEES  " 

Graduates 

NON-ehAbuAtEs 

ATTRITION 

CRITERIA 

SELECTED 

RATE  (%) 

ACTUAL: 

EL>?0 

305 

227 

78 

25.6 

ALTERNATIVES: 

EL>95 

235 

184 

51 

21.7 

EL>1 00 

157 

132 

25 

15.9 

EL  >  90 

132 

116 

16 

12.1 

SC  7  100 

EL  >  90 

182 

154 

28 

15.4 

SC  7  95 

J 
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C.  MOS  C  ANALYSIS: 

This  analysis  also  considered  the  original  criteria  (EL  £  120)  as  a 
necessary  condition  for  any  new  criteria  since  the  sample  used  for  the 
discriminant  analysis  wes  chosen  by  this  criteria.  The  discriminant  analysis 
chose  GM  as  the  score  that  best  discriminates  between  pass  or  fall  groups 
(academic  failures).  The  non-academic  failures  were  not  Included  In  the 
analysis.  Chi-square  tests  were  performed  to  test  for  the  independence  of  a 
selection  criteria  from  the  pass  or  fall  classification.  Table  6  summarizes 
the  proposed  criteria,  their  attrition  rates,  and  the  number  of  soldiers 
selected  from  the  sample. 


TABLE  6 

RELATIVE  EFFECTIVENESS  OF  ALTERNATIVE  COURSE  SELECTION  CRITERIA  FOR  MOS  C 


selEctIoN 

CRITERIA' 

'ATTENDEES  ' 
SELECTED 

aromTEr" 

NDN-fiEABnATES' 

AIIRTIIUN 
RATE  (%1 

ACTUAL: 

ED  120 

182 

109 

73 

40.1 

ALTERNATIVES: 

EL>1 25 

103 

69 

34 

33.0 

EDI  30 

56 

46 

10 

17.9 

ED120  A 
GM5T25 

78 

60 

18 

23.1 

ED  120  A 
GH£l20 

102 

75 

27 

26.5 
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IV.  SUMMARY 

The  proposed  criteria  that  best  discriminates  between  graduating  or 
non-graduating  from  the  MOS  A  course  were  an  MM>;  100  and  ST  £  100.  The  best 
selection  criteria  fbr  MOS  8  were  EL  >,  90  and  SC  £  95.  Finally  the  most 
promising  selection  criteria  for  MOS  C  were  EL  £  120  and  GM  _>  120.  These 
conclusions  are  based  on  the  alternate  criteria  that  lower  course  attrition 
while  do  not  substantially  reduce  the  attendees  selected. 

V.  POINTS  OF  DISCUSSION: 

A.  The  data  for  the  MOS  A  and  MOS  B  analysis  contained  no  distinction 
between  academic  and  non-academic  fallires.  Since  ASVA6  composites  are 
Intended  to  Indicate  subject  aptitudes,  their  use  to  predict  non-academic 
failures  might  be  questioned.  Depending  on  the  sample  their  Inclusion  or 
exclusion  could  significantly  alter  the  conclusions. 

B.  All  samples  In  the  study  >*re  selected  from  current  courses. 
Therefore,  all  cases  met  the  current  selection  criteria  for  each  sample.  For 
MOS  B  and  MOS  C  the  current  criteria  were  Included  as  part  of  the  new 
criteria.  For  MOS  A  the  current  criteria  was  omitted.  The  justification  for 
this  is  that  current  criterion  Is  not  very  restrictive.  The  statistical 
justification  for  such  a  generalization  Is  lacking. 

C.  In  the  MOS  A  analysis  a  ranking  procedure  was  used  to  develop 
absolute  cut-off  scores  fbr  a  classification  procedtre.  The  relation  of  such 
a  procedure  to  that  of  the  discriminant  faction  and  Its  "optimal**  properties. 
If  any,  were  ignored. 
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THE  ARMOR  COMBAT  FOR  MODEL  SUPPORT  (ARCOMS)  FIELD  EXPERIMENT 


Roger  F.  Willis 

US  Army  TRADOC  Systems  Analysis  Activity 
White  Sands  Missile  Range,  New  Mexico  88002 


ABSTRACT. 

The  Armor  Combat  Operations  Model  Support  (ARCOMS)  Test,  Phase  II,  is  a 
force-on- force  field  experiment  aimed  at  collecting  target  acquisition  and 
engagement  data  for  use  in  the  design  and  running  of  combined  arms  simula¬ 
tions  and  war  games.  TRASANA  is  the  proponent  and  the  test  will  be  conducted 
by  TCATA  at  Fort  Hood  in  January-February  1981.  Tactical  realism  would  be 
unacceptably  sacrificed  If  certain  key  variables  were  controlled.  Due  to 
resource  constraints,  very  few  replications  can  be  run  under  given  conditions. 
This  presentation  will  pose  the  question  (specifically  for  ARCOMS)  of  how  to 
extract  the  maximum  amount  of  valid  Information  from  relatively  uncontrolled 
field  experiments  (operational  tests)  carried  out  with  very  small  sample  sizes. 

1.  Background. 

a.  ARCOMS  will  be  the  first  in  a  series  of  field  experiments  carried  out 
to  provide  better  input  data  for  combined  arms  models  and  war  games. 

b.  It  has  been  recognized  for  years  by  the  modeling  community  that  we  do 
not  have  adequate  data  on  attacker  detection  rates,  In  realistic  force-on- 
force  conditions,  and  on  attacker  engagement  dynamics  and  fire  distribution. 

The  ARCOMS  test  also  presents  the  opportunity  to  gain  valuable  insights  con¬ 
cerning  alternative  attacker  tactics  and  defender  detection  rates  and  defender 
fire  distribution. 

c.  For  the  first  time  Intervlslbility  between  combat  vehicles  will  be 
measured  dynamically  and  recorded  automatically. 

2.  Purpose. 

The  purpose  of  ARCOMS  Is  to  examine  the  combat  processes  In  a  force-on- force 
environment  and  to  provide  Input  data  for  TRADOC  combined  arms  models,  simu¬ 
lations  and  games.  Emphasis  Is  to  be  placed  on  Identifying  the  process  by 
which  the  attacker  acquires  and  uses  information  during  the  attack.  Data  on 
detection  probabilities  will  be  keyed  to  the  times  at  which  intervlslbility 
starts.  TRASANA  will  use  the  data  to  develop  algorithms  and  to  provide  Input 
parameters  for  the  revision  of  combat  models.  In  addition  to  serving  as  an 
empirical  source  for  probability  distributions  and  ether  data  for  models, 
the  ARCOMS  test  outputs  will  be  used  for  testing  a  number  of  hypotheses  about 
the  basic  nature  of  combat  processes. 
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3.  Scope.  ARCOMS  will  consist  of  a  series  of  force-on-force  experiments  of  a 
platoon-sized  unit  defending  against  a  company  force,  with  deliberate  variations 
In  terrain  and  attacker  tactics.  This  phase  will  examine  Intervisibility,  the 
fire  and  maneuver  interactions.  To  the  extent  possible,  low  visibility  condi¬ 
tions  will  be  considered. 

4.  Gross  Design  of  Test. 

a.  The  conditions  for  the  twenty-four  test  runs  (Individual  battles)  are 
defined  in  Table  1.  Four  major  factors  are  varied:  attacker  tactics,  type  of 
defense,  light  level,  and  terrain  type  (A  or  B).  Note  that  we  will  not  have 
enough  runs  to  Investigate  some  of  the  interactions  that  one  suspects  might 

be  important.  For  example,  the  rapid  approach  tactic  will  not  be  run  In  Type  B 
terrain;  there  will  be  no  night  trials  in  Type  B  terrain;  and  the  rapid  approach 
tactic  will  not  be  run  at  night. 

b.  It  will  be  possible  to  develop  estimates  of  the  impact,  of  some  of  these 

major  factors  on  key  measures  (e.g.,  on  average  attacker  detection  rate).  For 

the  impact  of  "light  level"  we  will  compare  run  set  A  with  run  set  B,  or  set  E 

with  set  F.  For  the  Impact  of  "attacker  tactics"  we  will  compare  set  A  with 
set  D  (or  set  E  with  set  H),  etc.  In  order  to  Increase  sample  sizes  In  some 

cases  we  will  lump  sets  together.  For  example,  we  get  a  sample  of  six  battles 

by  lumping  set  A  with  set  E,  assuming  that  the  differentiation  between  hasty 
defense  and  deliberate  defense  might  not  be  significant  (for  some  measures). 

5.  Quantities  to  te  Measured. 

a.  Before  listing  the  outputs  ultimately  needed  from  the  experiment  we 
will  discuss  the  quantities  that  will  actually  be  measured.  Briefly  we  need 
to  measure  things  like  who  could  ha'  ^  detected  whom  and  when,  who  actually 
detected  whom  and  when  and  why,  who  "killed"  (laser  hits)  whom  and  when,  and 
how  was  information  transmitted  and  used. 

b.  For  each  combat  vehicle  (attacker  as  vehicles  and  defender  vehicles) 
the  test  time-tagged  data  of  the  following  types  will  be  collected: 

(1)  position  location 

(2)  llne-of-sight  (laser  A) 

(3)  detection 

(4)  firing  (laser  B) 

(5)  hit  and/or  kill 

(6)  video  through  the  gunner  sight 

(7)  audio 
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6.  Uncontrolled  Factors. 


In  order  to  approach  tactical  realism  many  Important  factors  (that  could  In¬ 
fluence  the  quantities  measured)  will  not  be  controlled.  However,  to  the 
extent  possible,  the  values  assumed  by  these  uncontrolled  factors  will  be 
measured  or  estimated  during  the  trials  or  recaptured  after  the  trials.  Some 
of  these  uncontrolled  factors  are  listed  In  Table  2.  The  attacker  task  force 
commander  and  the  defender  platoon  contnander,  who  will  be  varied  extensively 
during  the  course  of  the  test,  will  each  be  given  a  broad  mission.  The  details 
of  how  they  carry  out  their  missions  will  be  up  to  them.  Actually  data  on  how 
much  variation  we  observe  between  Individual  commanders  presumably  carrying 
out  the  same  mission  will  also  be  Important  information. 

7.  Major  Test  Outputs. 

We  list  here  only  the  most  Important  measures  (dependent  variables)  expected  to 
be  produced  hy  analysis  of  the  data  collected  during  the  trials.  The  next  step 
would  be  to  correlate  each  of  these  measures  with  the  controlled  variables 
(Table  1)  and  also  with  the  uncontrolled  variables  (Table  2).  (An  example  Is: 
correlation  of  attacker  detection  rate  with  the  force  ratio  -  obtained  by 
dividing  the  number  of  attacker  weapons  ready  by  the  number  of  defender  weapons 
ready.  This  initial  force  ratio  will  usually  vary  from  battle  to  battle,  de¬ 
pending  on  the  states  of  readiness  of  the  Individual  weapons  plus  their  Instru¬ 
mentation  added  on  for  the  test.)  The  major  output  measures  are  as  follows: 

a.  attacker  detection  time  -  conditional  (given  a  detection) 

b.  attacker  detection  time  -  unconditional 

c.  defender  detection  time  -  conditional  (given  a  detection) 

d.  defender  detection  time  -  unconditional 

e.  attacker  engagement  time 

f.  defender  engagement  time 

g.  time  at  least  3  attackers  In  LOS,  etc. 

h.  time  at  least  3  defenders  In  LOS,  etc. 

1.  attacker  fire  distribution  patterns: 

(1)  defender  sites  intervlslble  with  attackers  but  not  engaged  by 
attackers 

(2)  defenders  engaged  "simultaneously"  by  2  attackers,  etc. 

(3)  number  of  rounds  fired  per  engagement 
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j.  defender  fire  distribution  patterns: 

(\)  attackers  intervisible  but  not  engaged  by  defenders 

(2)  attackers  engaged  "simultaneously"  by  2  defenders,  etc. 

(3)  number  of  rounds  fired  per  engagement 

k.  frequency  of  attackers  engaging  false  targets 

l.  frequency  of  defenders  engaging  false  targets 
3.  Additional  Hypot.!.  -ses . 

AUhouah  the  orimary  purpose  of  ARCOMS  Is  to  collect  data  on  detection  rules, 

*+  !>L.  etc.  to  orovide  inputs  for  combined  arms  models  and  war  games, 
tST*S  datt “It  will  be  used  by  TfifeffiA  to  investigate  a  number  of  .tactical 
hypotheses  In  the  areas  of: 

a.  detections  by  attacker 

b.  attacknr  communications 

c  •  packer  control  of  movements 

d.  .'-gradation  or  enhancement 

e.  defender  allocation  of  fire 

f.  defender  disengagements 

These  hypotheses,  after  field  testing,  will  be  either  rejected,  °r 

lified  and  the  accompanying  analyses  will  provide  Insights  that  will  be 
■von  more  valuable  than  model  Inputs.  These  Insights  might  contribute  to 
improvement  In  the  structures  of  the  combat  models  and  to  more  credible 
theories  of  combat. 

9.  Analysis  Procedures. 

The  following  types  of  analyses  will  be  carried  out  with  the  ARCOMS  data: 
o.  Plotting  and  graphics  of  battles 
1;  Serial  correlations 


Dcsf*1.  *>tu  1 1  *-  i  f ■ 


d.  Analysis  <-f 

Theoretical  di; :  r/-,.i  ion  fitting 
Hypotheses  tcstii.r 


248 


g.  Evaluation  of  tactics 

b.  Comparison  of  LOS  data  with  digitized  terrain 

1.  Analysis  of  detection  data  and  model  Improvement  (by  NVEOL) 
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TABLE  1  -  ARCOMS  TEST  RUNS 
(COMPANY  TASK  FORCE  VERSUS  PLATOON  SLICE) 


TABLE  2  -  ARCGMS  -  UNCONTROLLED  FACTORS 


A.  DEFENDER 

1.  Decision  to  move 

2.  Number  of  weapons  ready 

3.  Communications  (target  handoff) 

4.  Frequency  of  firing 

5.  Distribution  of  fire 

6.  Amount  of  concealment 

7.  Open-fire  ranges 

B.  ATTACKER 

1.  Velocity 

a.  Individual  weapons 

b.  platoons 

2.  Specific  movement  patterns  (use  of  terrain,  trees,  etc.) 

3.  Number  of  weapons  ready 

4.  Use  of  overwatchers 

5.  Familiarity  with  terrain 

6.  Conmunl cations  (target  handoff) 

7.  Distribution  of  fire 

8.  Frequency  of  firing 

C.  ENVIRONMENT 

1.  Visibility 

2.  Weather 

3.  Other  obscuration 

4.  Range  (distance) 

5.  Vegetation 

6.  Angle  of  sun 

7.  Target  background 


EXTREME  VALUE  QUANTILE  RESPONSE  EXPERIMENTAL  DESIGN 


Jill  H.  Smith 
Jerry  Thomas 

Probability  and  Statistics  Branch 
Ballistic  Modeling  Division 
U.S.  Army  Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,  Maryland 


ABSTRACT.  An  experimental  design  has  been  developed  to  be  used  to 
determine  the  shielding  thickness  required  between  rounds  stored  in  a 
storage  area  to  prevent  round -to -round  propagation  from  an  initial  explosion. 
Extreme  value  quantile  response  techniques  were  used  with  shielding 
thickness  as  the  stimulus  variable.  The  developed  design  drastically 
reduces  the  sample  size  required  for  a  given  quantile  and  confidence 
when  compared  with  known  distribution-free  extreme  value  designs. 

1.  INTRODUCTION .  The  Terminal  Ballistics  Division  of  the  Ballistic 
Research  Laboratory  encountered  the  problem  of  determining  how  thick  the 
shielding  should  be  between  rounds  of  ammunition  stored  in  a  storage  area 
to  prevent  round-to-round  propagation  from  an  initial  explosion.  Vulnera¬ 
bility  analysis  indicated  that  the  probability  of  survival  of  the  storage 
area  would  drastically  decrease  with  an  increase  in  the  number  of  rounds 
exploding.  Prior  testing  has  shown  that  shielding  material  placed 
between  rounds  could  prevent  neighboring  rounds  from  exploding.  Due  to 
space  limitations  in  the  storage  area,  it  was  desired  to  keep  the  shielding 
thickness  to  a  minimum  and  simultaneously  minimize  the  probability  of 
round-to-round  propagation. 

It  was  decided  that  the  specific  objective  of  the  test  would  be  to 
find  the  shielding  thickness  needed  to  be  90%  confident  that  the  proba¬ 
bility  of  a  neighboring  round  exploding  is  less  than  0.1. 

The  problem  appeared  to  fit  into  the  category  of  extreme  value 
quantile  response  problems.  Defining  X  as  the  stimulus  variable,  in  this 
case  the  thickness  of  the  shielding  which  effects  the  stimulus,  and  the 
probability  of  a  response  associated  with  a  given  X,  x,  is  described 
by  a  nonresponse  function  M(x) .  (Usual  notation  has  M(x)  as  the  probability 
of  response.  However,  defining  M(x)  as  a  nonresponse  is  more  natural  for 
this  problem.)  This  function  is  assumed  to  be  monotonically  nondecreasing 
with  increasing  stimulus  levels. 

A  discussion  of  available  designs  and  the  modified  design  chosen 
for  the  experiment  is  contained  in  the  following  chapters. 

2.  AVAILABLE  DESIGNS.  A  nonparametric  approach  was  taken  because  of 
the  lack  of  information  about  the  response  function.  As  stated,  the 
quantile  in  which  we  are  interested  is  a  ■  .10,  and  therefore  is  in  the  tail 
of  the  response  distribution.  From  a  review  of  the  available  designs  in 
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the  literature  the  only  nonparaaetric  teat  designs  available  for  testing 
in  the  tail  regions  are  the  Alexander  Extreme  Value  Design  and  the 
Rothman  Design.  Of  these,  the  Alexander  Extreme  Value  Design  is  preferred 
since  it: 

1)  is  "generally  more  efficient  than  other  available  nonparaaetric 
designs,  and  is  asymptotically  as  efficient  as  the  best  parametric  stochastic 
approximation  when  distributional  assumptions  are  valid,"1 

2)  has  significantly  simpler  design  rules  and  analysis  procedures 
than  the  Rothman  Design,  and 

3)  does  not  differ  in  median  required  sample  site. 

3.  ALEXANDER  EXTREME  VALUE  DESIGN.  The  Alexander  Extreme  Value 
Design  assumes  only  a  monotone  nondecreasing  response  function  as  the 
stimulus  increases. 

A.  Design  Rules 

1)  The  first  test  is  at  level  (shielding  thickness)  X.,  the 
a  priori  best  guess  of  X  . 

2)  Testing  is  performed  by  alternately  increasing  and  decreasing 
sequences  of  test  levels.  The  test  levels  are  increased  or  decreased  by  a 
step  size  5,  where  6  is  a  fraction  of  an  estimate  of  the  standard  deviation. 
Terms  such  as  "higher"  and  "level  above"  refer  to  thicker  shielding 
levels,  and  "below"  and  "lowest"  refer  respectively  to  thinner  and  thinnest 
shielding  thickness  levels. 

3)  The  first  sequence  decreases  the  levels  until  a  response 
(explosion)  is  observed. 

4)  The  first  test  of  an  increasing  sequence  is  at  the  level  above 
the  highest  level  at  which  a  response  has  been  observed.  The  increasing 
sequence  ends  at  level  X.  such  that  in  the  corresponding  zero  region2  less 
than  or  equal  to  Xj  at  least  N  nonresponses  have  been  observed.  Values 
for  N  can  be  found  from 

(1  -  a)n  •  l  -  P  (2.1) 

where  N  ■  (n)  ♦  1  and  P  is  some  specified  probability. 


D.  Rothman,  M.  J.  Alexander  and  J.  N.  Zimmerman,  The  Design  and  Analysis 
of  Sensitivity  Experiments,  NASA  CR-62026,  Vol.  I,  p.  74. 

2 

Zero  region  -  stimulus  region  above  the  highest  level  at  which  a  response 
has  been  observed. 


5)  The  first  test  of  a  decreasing  sequence  is  at  the  level  above 
the  highest  level  at  which  a  response  has  been  observed.  If  the  result 
is  a  response,  the  sequence  ends;  otherwise,  one  more  test  at  the  next 
lower  level  is  performed. 

6)  Testing  terminates  when  there  are  three  adjacent  levels,  X^, 

X  ♦  6  and  X_  ♦  25  such  that  at  least  one  response  has  been  observed  at  X_ 
and  none  at  a  higher  level,  and  a  total  of  N  nonresponses  have  been 
observed  at  Xy  ♦  5  and  X^,  ♦  25.  (6  is  the  step  size  between  levels.) 

A 

7)  The  maximum  likelihood  estimate  of  XQ,  XQ,  is  found  by  the  method 
of  reversals  and  linear  interpolation  (see  Appendix). 

B.  Analysis 

We  are  interested  in  the  a  ■  .1  quantile  of  the  response  distribution, 
that  is,  the  value,  X  ^  at  which  the  probability  of  a  response  is  .1. 

Therefore,  the  probability  of  a  nonresponse  at  the  X  ^  quantile  is  (1  -  .1). 

The  probability  of  n  nonresponses,  assuming  the  n  tests  are  independent, 

is  (1  -  . l)n.  The  probability  of  at  least  one  response  out  of  n  tests 

is  l  -  (1  -  .l)n.  Specifying  the  probability  of  at  least  one  response 
out  of  n  tests  at  the  X  ^  quantile  to  be  P  *  .9,  we  have 

.9  -  1  -  (1  -  .l)n. 

This,  with  a  slight  algebraic  manipulation,  is  Equation  2.1  with  o  ■  .1 
and  P  ■  .9.  Solving,  N  ■  fn]  ♦  1  ■  22.  Hence,  we  would  expect  with 
probability  .9  at  least  one  response  out  of  22  tests  at  the  X  j  quantile. 

If  we  observe  22  nonresponses  at  some  level  XA,  we  can  assume  we  are  not 
at  the  X  j  quantile  and,  in  fact,  the 

Prob  (X  j  <  X*}  >  .9  . 

Using  the  above  argument,  we  can  conclude  from  the  Alexander  Extreme 
Value  Design  that  the  level  at  which  the  true  probability  of  response 
is  .1  is  less  than  Xr  ♦  25  with  ninety  percent  confidence.  The  point  estimate 

of  the  X  j  quantile  can  be  found  using  the  method  of  reversals  outlined  in  the 

Appendix. 

C.  Simulation 

Based  on  "guestimates"  for  X  s  and  X  75>  a  response  distribution  was  hypothe¬ 
sized  with  which  to  Monte  Carlo  the  Alexander  Extreme  Value  Design  for 
a  ■  .1  and  P  •  .9.  The  response  distribution  assumed  was  the  cumulative 
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normal  distribution  with  mean  •  .S  and  variance  ■  .14.  (Note,  however, 
that  the  test  design  and  analysis  procedures  are  distribution-free.) 

The  smallest  practical  step  size  of  shielding  thickness  was  1/8  inch. 

Figures  1  and  2  are  examples  of  the  Alexander  Extreme  Value  Design 
Monte  Carloed  to  illustrate  the  design  rules.  Responses  are  denoted  by 
"X"  *s  and  nonresponses  by  “O'*  's.  1^  denotes  the  i-th  increasing  sequence 

and  D.  the  j-th  decreasing  sequence.  The  number  of  rounds  required  (NR), 

J  * 

the  maximum  likelihood  estimate  of  the  .1  quantile  (X  ) ,  and  the 

(Xr  ♦  26)  level  are  given  for  each  simulation. 

Figure  3  shows  the  distribution  of  the  number  of  rounds  required 
for  500  simulations  of  the  above  design.  The  number  of  rounds  required 
is  twice  the  number  of  responses  and  nonresponses  shown  for  each  simulation 
since  a  donor  round  must  be  detonated  for  each  test  round.  The  average 
number  of  rounds  required  to  complete  the  test  was  166,  the  median  was 
164  and  ten  percent  of  the  tests  required  184  rounds  or  more. 

The  distribution  of  the  maximum  likelihood  estimates  of  X  .  for 

•  X 

the  S00  simulations  is  given  by  the  histogram  in  Figure  4.  the  distribu- 

A 

tion  of  X  j  is  asymptotically  normal  about  the  true  X  ,  quantile  »  7.83. 

The  distribution  generated  by  the  test  data  shown  in  Figure  4  has  a 
mean  of  7.77,  which  is  in  good  agreement  for  500  simulations,  and  is 
approximately  normally  distributed  as  shown  by  the  overlying  normal  curve. 

Figure  5  shows  the  distribution  of  level  Xr  ♦  26  for  500  simulations. 
This  is  the  level  about  which  we  can  conclude  that  the 

Prob  (X  ,  <  X  ♦  26}  >  .9. 

•  i  r 

4.  MODIFICATION  OF  THE  ALEXANDER  EXTREME  VALUE  DESIGN.  The  median 
number  of  rounds  required  for  the  Alexander  Extreme  Value  Design  (EVD),  as 
described  in  the  previous  section,  was  164  as  determined  by  the  500 
simulations.  Since  the  number  of  rounds  available  for  testing  was  con¬ 
siderably  smaller,  the  major  objective  in  modifying  the  Alexander  EVD  was 
to  reduce  the  number  of  rounds  required,  while  maintaining  the  confidence 
level  and  the  ability  to  compute  the  point  estimate  of  the  X  .  quantile. 

The  Alexander  EVD  requires  that  a  donor  round  be  detonated  for  each 
test.  The  number  of  donors  needed  can  be  reduced  by  using  one  donor  to 
detonate  up  to  four  test  rounds  (acceptors) .  Figure  6  shows  the  config¬ 
uration  of  four  acceptors  per  donor.  Steel  shielding  will  be  placed 
between  acceptors,  as  shown  by  the  dotted  lines,  if  interaction  between 
acceptors  is  observed.  Optimizing  the  number  of  acceptors  per  donor  in 
the  Alexander  EVD  reduces  the  number  of  rounds  required  by  approximately 
33  percent. 


256 


It  was  noticed  that  the  rounds  above  level  Xf  ♦  25  were  neither  used  to 

establish  the  confidence  statement,  nor  to  terminate  the  test  design,  nor  to 
compute  the  point  estimate  of  the  X  j  quantile.  By  limiting  each  increasing 

sequence  above  the  highest  stimulus  level  at  which  a  response  Has  been 
observed,  the  rounds  "wasted"  above  level  Xr  ♦  25  can  be  eliminated.  There 

is  a  trade-off  in  eliminating  these  rounds  since  the  level  X*  *  25  can 

change  if  a  response  is  observed  at  a  higher  level.  Therefore,  some  testing 
should  be  above  Xr  ♦  25  until  more  than  half  the  number  of  rounds  required 

to  demonstrate  the  chosen  probability  are  at  levels  Xf  ♦  5  and  X^  ♦  26. 

Testing  at  X  and  below  is  used  in  the  determination  of  the  point  estimate 
of  X<1. 

The  following  test  design  is  the  result  of  many  Monte-Carlo  simula¬ 
tions  in  which  different  starting  levels,  number  of  acceptors  per  donor 
and  sequences  of  testing  have  been  tried  in  order  to  minimize  the  required 
number  of  rounds,  yet  retain  the  confidence  level  and  point  estimate  of 
the  quantile  X 

A.  Modified  Design  Rules 

1)  The  first  test  level  is  Xj,  the  best  a  priori  guess  of  X  . 

6  is  the  step  size  between  levels."1  “ 

2)  One  acceptor  per  donor  is  used,  in  a  decreasing  sequence,  until  a 
response  is  observed.  Let  X  be  the  highest  level  at  which  a  response 

is  observed. 

3)  After  the  first  response,  the  number  of  acceptors  per  donor 

in  each  test  is  increased  to  alternately  three  and  then  four.  After  the 
first  response,  three  acceptors  per  donor  are  tested  at  the  next  three  levels 
above  X  .  Then  four  acceptors  per  donor  having  shielding  at  levels  X 
and  the  next  three  higher  levels  are  tested.  r 

4)  If  another  response  is  observed  at  a  higher  level,  it  becomes 
X  ,  and  testing  continues  alternating  three  and  then  four  acceptors  per 
donor  until  at  least  12  (more  than  half  the  required  22)  nonresponses 
have  been  observed  at  the  two  levels  immediately  above  Xr. 

5)  When  at  least  12  nonresponses  have  occured  at  Xr  ♦  5  and  Xf  ♦  26, 

the  number  of  acceptors  per  donor  is  reduced  to  alternately  two  above  X  and 
then  three,  starting  at  Xr,  for  the  remainder  of  the  test.  r 

6)  Testing  terminates  when  at  least  N  (22)  nonresponses  have  been 
observed  at  the  two  levels  immediately  above  the  highest  level  at  which 
a  response  has  been  observed. 


ft.  Analysis  of  the  Modified  Design 

As  in  the  Alexander  Extreme  Value  Design,  we  have  N  ■  22  nonresponses 

at  X  ♦  6  and  X.  *  26  and  can  conclude  that  we  are  not  at  level  the  X  . 
r  r  •* 

quantile  and  in  fact, 

pTOb  {X  .  <  X„  ♦  26)  >  .9. 

The  point  estimate  can  again  be  found  using  the  method  of  reversals.  There¬ 
fore,  the  changes  in  the  test  design  have  not  affected  the  confidence  statement 
or  the  point  estimate. 

C.  Simulations  Using  Modified  Design 

Using  the  same  response  function  that  was  used  when  simulating  the 
Alexander  Extreme  Value  Design,  SOO  simulations  of  the  modified  design 
were  also  Monte-Carl oed. 

Figures  7  and  8  are  examples  of  the  modified  test  design  illustrating 
the  modified  design  rules.  Again,  responses  are  denoted  by  "X"  's  and 
nonresponses  by  "0"  ' s.  The  abscissa  represents  individual  tests  rather  than 
sequences  of  tests  as  shown  in  the  Alexander  Extreme  Value  Design. 

Figure  9  shows  the  distribution  of  the  required  number  of  rounds  for 
the  SOO  simulations.  The  median  number  of  rounds  required  was  67  and  the 
mean  number  of  rounds,  70.  Only  ten  percent  of  the  simulations  required 
93  or  more  rounds. 

The  histogram  in  Figure  10  is  the  distribution  of  the  maximum 
likelihood  estimates  of  X  j  for  the  SOO  simulations.  Again,  the  distri¬ 
bution  of  the  maximum  likelihood  estimates  are  asymptotically  normal 
about  the  true  X  .  quantile  ■  7.83.  The  distribution  shown  has  a  mean  of 
8.00,  and  is  approximately  normally  distributed  as  shown  by  the  overlying 
normal  curve.  Figure  12  shows  the  distribution  of  number  of  rounds  required 
for  both  the  Alexander  EVD  and  the  Modified  Alexander  EVD.  The  Modified 
Alexander  EVD  is  on  the  left  and  the  Alexander  EVD  is  on  the  right. 


5.  SUMMARY.  The  Alexander  EVD  was  modified,  mainly,  by  using  multiple 
rounds  per  test  and  by  limiting  the  number  of  rounds  above  the  highest 
response.  These  changes  resulted  in  a  design  that  required  less  than  half 
the  rounds  of  the  Alexander  EVD  in  the  simulations  performed.  The  range  of 
the  required  number  of  rounds  (NR)  for  the  Alexander  EVD  was  from  134  to  2S6 
and  for  the  modified  Alexander  EVD  was  46  to  140.  The  Modified  Alexander 
EVD  has  simple  design  rules  that  permit  the  estimation  of  an  extreme 
value  of  a  quantile  response  function  and  the  associated  confidence  interval. 

This  report  used  only  the  normal  distribution  as  the  assumed  underlying 
distribution  for  the  Monte  Carlo  simulations.  Other  distributions  are  currently 
being  used  for  this  purpose.  A  reduction  in  the  number  of  rounds  required 
for  these  distributions  is  also  expected.  Recall,  however,  that  neither 
the  experimental  design  nor  the  analysis  methods  require  the  assumption  of 
a  response  distribution.  The  design  is  distribution-free. 


APPENDIX 


METHOD  OP  REVERSALS  FOR  SENSITIVITY  DATA 


1.  METHOD.  The  method  of  reversals  is  s  maximum* likelihood  procedure 
for  obtaining" 'di str ibut ion- free  estimates  of  a  monotone  nondecreasing 
response  function.  The  test  stimulus  levels  are  X.(i  ■  l,2,...,k)  and 
are  ordered  from  thickest  to  thinnest  shielding  thickness, 

Xj  >  X2  >  ...  >  Xk  (A.l) 

A 

If  is  the  estimate  of  the  probability  of  response  at  Xj,  and  if  we 

assume  that  the  response  function  is  monotone  nondecreasing,  then  neces¬ 
sarily 

A  A  A 

Pj  <  p2  <  ...  <  pk.  (A. 2) 

The  algorithm  below  can  be  used  to  find  the  estimates  of  the  response  dis¬ 
tribution  and  their  associated  stimulus  levels. 


1)  Let  X,  (i  *  l,2,...,k)  be  the  k  stimulus  levels  at  which  data  have  been 

1  A 

collected,  where  Xj  >  X2  ...  -  X^.  We  wish  to  find  the  estimates,  p^,  of 
the  values  ■  M[X^) ,  the  response  probabilities  at  the  levels  X^,  which 
satisfy  Equation  A. 2. 

2)  Let  n^  (i  •  l,2,...,k)  be  the  number  of  tests  performed  at  level  Xj^  and 
fj  (i  ■  l,2,...,k)  be  the  number  of  responses  observed  in  the  n^  tests. 
Consider  the  sequence 


If  this  sequence  is  nondecreasing,  then  the  estimates  p,  are  simply  given 
by 


i  i*l 

3)  If  for  some  i,  — -  >  - -  ,  replace  both  by 

ni  Vl 


F 


1,1+1 

i»i+l 


fi  -  fi+l 
ni  4  ni+l 
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The  new  sequence  is  then 


If  this  sequence  still  contsins  a  Teversel,  a  pair  of  consecutive  fractions 
for  which  the  first  is  greater  than  the  second,  replace  the  pair  with  a 
single  term  as  above.  This  process  is  continued  until  one  obtains  a  non¬ 
decreasing  sequence: 


♦l  *2  .*3 

V  V  ^3 . 


♦ ,  fA  ♦  . . .  ♦  £k 

where  «  — - — 

nj  ni*-*Vs 


for  appropriate  i  and  s. 


4.  The  final  estimates  are  given  by 

*  A 
pi . • 

5.  Linear  interpolation  is  used  to  compute  the  values  of  the  response 
function  between  stimulus  levels  tested. 


2.  EXAMPLE.  If  the  results  of  the  experiment  were  as  shown  in  Figure  A1 
the  max imua  likel ihood  estimate  found  by  the  method  of  reversals  is  as 
follows: 


Shielding  Thickness 

fi/ni 

Fi,i->1 

u 

*1 

ft 

(Inches) 

J 

4/8 

1/1 

1.0 

S/8 

0/2  n 

.s 

6/8 

1/2  J 

° 

.3 

7/8 

2/6 

J 

.3 

8/8 

1/9 

.11 

9/8 

0/12 

0 

'0/8 

0/10 

0 

The  shielding  thickness  corresponding  to  the  .1  quantile  is  found  by 
linear  interpolation. 


The  shielding  thickness  associated  with  the  .1  quantile  is  1.01  inches. 
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The  Rank  Transformation  as  a  Robust 
and  Powerful  Tool  for  the  Analysis  of  Experimental  Data 

W.  J.  Conover,  Texas  Tech  University 


Abstract 


Rank  Transformation  procedures  are  ones  In  which  the  usual  parametric  proce¬ 
dure  is  applied  to  the  ranks  of  the  data  Instead  of  to  the  data  themselves.  In 
the  one  way  layout  the  rank  transformation  procedure  Is  equivalent  to  the 
Kruskal-Wallls  test.  Simulation  results  using  various  distributions  show  that 
this  procedure  tends  to  have  more  power  than  either  the  F  test  or  Fisher's  ran¬ 
domization  test,  a  well  known  nonparametrlc  procedure. 

The  rank  transformation  procedure  for  the  two  way  layout  is  compared  with 
the  F  test  and  Fisher's  randomization  test  under  normality  and  several  types  of 
nonnormality.  Overall  the  rank  transformation  procedure  seems  to  be  the  best. 

The  Fisher's  LSD  multiple  comparisons  procedure  in  the  one  way  and  two  way 
layouts  Is  compared  with  a  randomization  procedure  and  with  the  same  procedure 
computed  on  ranks.  In  nonnormal  situations  the  rank  transformation  procedure 
appears  to  maintain  power  better  than  Fisher's  LSD  or  the  randomization  proce¬ 
dures.  The  conclusion  of  this  study  is  that  the  rank  transformation  provides  a 
reasonable  alternative  to  the  usual  analysis  of  experimental  designs. 
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1.  INTRODUCTION 


Three  methods  for  analyzing  experimental  data  are  compared  In  this  study. 

The  first  Is  the  standard  analysis  of  variance  procedure  based  on  the  assumption 
of  normality  and  assumed  to  be  robust  In  most  situations  encountered  In  prac¬ 
tical  applications.  The  second  Is  a  randomization  procedure  attributed  to  R.A. 
Fisher  (1935),  which  Is  known  to  be  "most  efficient"  In  some  sense,  and  Is 
assumed  by  many  practitioners  to  be  the  best  test  one  could  possibly  use, 
although  It  Is  difficult  to  use  even  with  a  computer.  The  third  procedure 
Involves  a  rank  transformation  of  the  data  prior  to  the  application  of  the  first 
procedure,  that  Is  It  Is  an  analysis  of  variance  on  the  ranks. 

These  three  procedures  are  compared  in  a  completely  randomized  design  (a 
one-way  layout)  and  In  a  randomized  block  design.  Other  designs  could  just  as 
easily  have  been  selected  for  comparison,  but  the  randomization  test  Involves 
such  extensive  computer  time  that  only  a  limited  study  Is  possible.  The  robust¬ 
ness  of  all  three  procedures  Is  estimated  under  the  null  hypothesis  by  computer 
simulation,  and  the  power  Is  estimated  under  the  assumed  existence  of  treatment 
effects,  also  by  computer  simulation.  A  multiple  comparisons  procedure  Is  used 
whenever  the  null  hypothesis  Is  rejected,  and  comparisons  of  the  three  multiple 
comparisons  procedures  are  made  also. 

These  results  were  obtained  by  Ronald  L.  Iman  of  Sandla  National 
Laboratories  In  some  joint  research  work  with  the  author.  More  extensive 
results  appear  In  the  unpublished  manuscripts  by  Iman  and  Conover  (1980a  and 
1980b)  and  by  Conover  and  Iman  (1980). 
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The  Rank  Transformation  as  a  Robust 
and  Powerful  Tool  for  the  Analysis  of  Experimental  Data 

W.  J.  Conover,  Texas  Tech  University 

Abstract 

Rank  Transformation  procedures  are  ones  In  which  the  usual  parametric  proce¬ 
dure  is  applied  to  the  ranks  of  the  data  Instead  of  to  the  data  themselves.  In 
the  one  way  layout  the  rank  transformation  procedure  Is  equivalent  to  the 
Kruskal-Wallis  test.  Simulation  results  using  various  distributions  show  that 
this  procedure  tends  to  have  more  power  than  either  the  F  test  or  Fisher's  ran¬ 
domization  test,  a  well  known  nonparametrlc  procedure. 

The  rank  transformation  procedure  for  the  two  way  layout  is  compared  with 
the  F  test  and  Fisher's  randomization  test  under  normality  and  several  types  of 
nonnormal Ity.  Overall  the  rank  transformation  procedure  seems  to  be  the  best. 

The  fisher's  ISO  multiple  comparisons  procedure  In  the  one  way  and  two  way 
layouts  1 s  compared  with  a  randomization  procedure  and  with  the  same  procedure 
computed  or.  ranks.  In  nonnormal  situations  the  rank  transformation  procedure 
appears  to  maintain  power  better  than  Fisher's  LSD  or  the  randomization  proce¬ 
dures.  The  conclusion  of  this  study  Is  that  the  rank  transformation  provides  a 
reasonable  alternative  to  the  usual  analysis  of  experimental  designs. 


2.  THE  COMPLETELY  RANOOMIZEO  DESIGN 


Let  Xjj,  l<1<nj,  Uj£k  be  random  variables  representing  the  1th  observation 
In  treatment  j  in  a  completely  randomized  design.  Let  7.j  and  7,,  represent  the 
sample  treatment  mean  and  the  overall  mean  respectively.  The  F  statistic  Is 
given  by 


(N-K)  Inj.(7j  -7..)2 

F  -  - i - 

(k-1)  II  (Xu  -7j)2 


(2.1) 


Where  N  »  I  n<  Is  the  total  sample  size.  The  F  test  compares  the  F  statistic 
with  the  F  distribution,  k-1  and  n-k  degrees  of  freedom,  and  rejects  the  null 
hypothesis  of  equal  treatment  means  If  F  Is  In  the  upper  a  tall  of  the  F 
distribution.  Such  a  test  Is  exact  under  assumptions  of  Identical  normal 
distributions,  but  Is  robust  even  for  some  nonnormal  distributions.  If  the  null 
hypothesis  Is  rejected,  Fisher's  LSD  procedure  Is  used  to  declare  treatments 
jj  and  j2  significantly  different  when  the  Inequality 


l*.Jj  -*.J2|  >  t  «/2,N-k 


"Jl 


ni2 


Is  satisfied,  where 


(2.2) 


MSE 


-  &  II  (XU  ’  *-J)2 


1  J 


(2.3) 


and  where  tptin  Is  the  (1-p)  quantile  from  a  student's  t  distribution  with  m 
degrees  of  freedom. 

For  Fisher's  randomization  test  the  F  statistic  from  Equation  (2.1)  Is  com¬ 
pared  with  the  distribution  of  all  possible  F  statistics  arising  from  the  N l / 

n  (nji  ways  the  same  N  observations  can  be  partitioned  Into  k  groups  of  size 
j  J 

nj  each,  J»l,  ....  k.  In  practice,  even  with  high  speed  computers  and  moderate 


277 


sample  sizes  the  total  number  of  combinations  Is  too  large  to  handle,  so  the 
suggestion  of  Owass  (1957)  Is  followed.  That  Is,  a  random  subset  of  the  total 
number  possible  Is  used  to  obtain  an  unbiased  and  consistent  estimate  of  the 
distribution  function  of  the  randomization  statistic.  In  this  paper  k*4  and 
(nlf  n2»  nj,  04)  Is  (7,  8,  9,  10).  A  subset  of  1000  partitions,  out  of  the  more 
than  10*8  partitions  possible,  was  used  to  estimate  alphahat. 

Whenever  the  alphahat  was  5%  or  less  multiple  comparisons  were  made  using  a 
procedure  similar  to  that  described  above,  only  restricting  the  permutations  to 
the  ways  the  observations  In  the  two  samples  being  compared  can  be  partitioned. 
Here  again,  only  1000  of  the  possible  permutations  were  used  for  each 
comparison.  The  treatments  were  considered  significantly  different  If  the 
observed  value  of  j  |  was  among  the  largest  5%  obtained. 

The  third  test  consists  of  replacing  data  by  the  ranks  from  1  to  N,  and  per¬ 
forming  an  F  test  on  the  ranks.  This  Is  equivalent  to  the  Kruskal-Wall Is  test. 
Multiple  comparisons  were  made  by  computing  (2.2),  as  in  the  Fisher  LSD 
procedure,  but  using  the  same  ranks  used  above  instead  of  the  data.  These  three 
procedures  are  called  the  F,  R  and  RT  methods  respectively. 

Comparisons  of  these  three  tests  were  made  for  three  population  distribu¬ 
tions,  the  normal,  lognormal  and  exponential  distributions.  The  null  case  was 
examined,  along  with  three  non-null  settings  corresponding  to  slight,  medium, 
and  strong  differences  In  treatment  effects.  The  parameters  used  are  sum¬ 
marized  In  Table  1. 

In  each  of  these  12  combinations  of  distributions  with  treatment  effects, 

500  replications  were  made  to  compare  the  robustness  and  power  of  the  three 
tests.  These  results  are  given  In  Table  2.  They  show  that  the  Fisher  ran¬ 
domization  test  and  the  rank  transform  test  are  robust  for  all  three  distribu¬ 
tions,  as  expected  because  they  are  both  nonparametrlc  procedures.  The  F  test 
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TABLE  1.  The  population  effects  used  In  the  completely  randomized  design  simu¬ 
lation  study:  Means  of  the  normal  (o‘  .  4),  means  of  the  log  of  the 
lognormal  ( «  2  0f  logs  *  4),  means  of  the  exponential. 


EFFECTS 

NORMAL 

LOGNORMAL 

EXPONENTIAL 

Null 

(0,0,0,0) 

(0,0,0, 0) 

(1,1, 1,1) 

Slight 

(0,0, 0,1) 

(0, 0,0,1) 

(2,2, 2, 3) 

Medium 

(0,0, 1,2) 

(0,0, 1,2) 

(1. 1,2,3) 

Strong 

(0,1, 2, 3) 

(0,1, 2, 3) 

(1,2, 3,4) 

TABLE  2.  The  percent  of  time  the 
randomized  design,  four 

null  hypothesis  was 
treatments,  nj»7,  ng 

rejected  in  the  complete 
a8,  n339,  n4**10. 

cFFECTS 

NORMAL 

LOGNORMAL 

EXPONENTIAL 

Error  Rate 

R  F  RT 

R 

F 

RT 

R  F 

RI 

in  Null  Case: 

5%  5%  5% 

6% 

2% 

7% 

4%  4% 

5% 

Power  Under 

Slight  Effects: 

19%  19%  19% 

9% 

4% 

17% 

12%  10% 

12% 

Medium  Effects: 

52%  52%  49% 

18% 

12% 

43% 

47%  40% 

46% 

Strong  Effects: 

72%  72%  70% 

22% 

13% 

69% 

43%  37% 

53% 

on  the  other  hand  is  robust  for  the  normal  and  exponential  distributions,  but 
quite  conservative  for  the  lognormal  distribution.  The  conservative  nature  of 
the  F  test  carries  over  to  Inhibit  its  power  for  detecting  differences  In 
lognormal  distributions.  The  rank  transform  procedure  shows  the  most  power  In 
the  lognormal  and  exponential  cases,  and  about  the  same  power  as  the  other  two 
procedures  when  the  distributions  are  normal. 

When  the  null  hypothesis  was  rejected  using  the  previous  procedures,  the 
corresponding  multiple  comparisons  tests  were  made  as  previously  described.  The 
results,  summarized  in  Table  3,  show  the  same  types  of  results  as  In  Table  2. 


* 
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TABLE  3.  The  number  of  times  treatment  pairs  were  declared  significantly  dif¬ 
ferent  In  500  simulations,  using  CR  design  with  4  treatments,  ni  ■  7, 

A  A  I  M  * 


r\2  ■  8,  r»3  ■ 

9,  rty 

-  10. 

TREATMENT 

NORMAL 

LOGNORMAL 

EXPONENTIAL 

EFFECTS 

R 

P 

“  RT 

R  ' 

F~ 

Tr 

r~ 

F 

~tt 

Null  1,2* 

6 

6 

7 

10 

9 

8 

6 

8 

9 

1,3* 

5 

6 

5 

8 

7 

7 

5 

9 

9 

2,3* 

6 

4 

5 

7 

7 

12 

10 

10 

10 

1,4* 

4 

4 

3 

16 

5 

9 

4 

4 

8 

2,4* 

5 

5 

4 

10 

6 

9 

3 

5 

7 

2,4* 

6 

6 

5 

4 

3 

6 

6 

5 

11 

Slight  1,2* 

17 

21 

17 

3 

5 

17 

7 

7 

13 

1,3* 

14 

19 

12 

4 

4 

13 

10 

7 

12 

2,3* 

21 

23 

20 

4 

2 

22 

7 

4 

15 

1,4 

54 

53 

51 

11 

20 

57 

20 

24 

23 

2,4 

56 

53 

52 

11 

18 

57 

20 

28 

15 

3,4 

48 

48 

54 

16 

18 

51 

30 

34 

24 

Medium  1,2* 

16 

24 

19 

10 

0 

20 

9 

0 

13 

1,3 

63 

73 

77 

8 

4 

71 

56 

37 

93 

2,3 

81 

78 

83 

15 

5 

61 

75 

42 

89 

1,4 

189 

200 

198 

26 

38 

184 

143 

174 

186 

2,4 

206 

210 

203 

45 

36 

180 

172 

180 

185 

3,4 

81 

79 

75 

36 

36 

86 

54 

86 

63 

Strong  1,2 

69 

77 

64 

9 

0 

60 

40 

8 

64 

1,3 

207 

220 

209 

22 

4 

210 

86 

56 

146 

2,3 

64 

80 

88 

14 

4 

90 

36 

27 

49 

1,4 

332 

338 

339 

49 

59 

328 

148 

137 

208 

2,4 

220 

236 

235 

51 

58 

233 

84 

108 

95 

3,4 

79 

81 

76 

39 

49 

87 

36 

76 

38 

Simple  totals: 
Identical  populations 

100 

118 

97 

76 

48 

123 

67 

59 

107 

Some  effects  present 

1749 

1826 

1804 

362 

349 

1775 

1000 

1017 

1278 

*These  populations  are 

Identical. 

That  Is,  the  LSO  procedure  on  the  ranks  has  more  overall  power  to  detect 
differences  where  they  exist  than  the  other  two  types  of  procedures  do.  In 
summary,  for  the  CR  design  the  transformation  to  ranks  prior  to  the  usual  analy¬ 
sis  improves  the  robustness  and  power  of  the  usual  analysis  In  nonnormal 
situations  without  losing  much  of  the  fine  qualities  of  the  usual  analysis  In 
the  normal  situation. 
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3.  THE  RANDOMIZED  COMPLETE  SLOCK  DESIGN 


Let  X^j,  for  Kl£b  and  l<j<k,  be  random  variables  associated  with  the  1th 
block  and  the  jtti  treatment,  and  let  7^.,  7,j  and  7t,  be  the  sample  block, 
treatment  and  grand  means  respectively.  The  F  statistic  Is  given  by 


b(b-l)  l  (Xtj  -  X,.)? 

\  ]  <*1j-T.j )2 


(3.1) 


The  parametric  F  test  compares  the  F  statistic  with  quantiles  of  the  F  distribu¬ 
tion  with  k-1  and  (b-1)  (k-1)  degrees  of  freedom.  These  quantiles  are  exact 
under  normality,  additivity,  and  equal  variances,  and  are  reasonable  approxima¬ 
tions  under  mild  violations  of  the  normality  assumption.  If  the  F  statistic  Is 
In  the  upper  tail  of  the  F  distribution,  the  null  hypothesis  of  equal  treat¬ 
ment  means  Is  rejected,  and  multiple  comparisons  are  made.  Treatments  j^  and 
$2  are  declared  significantly  different  If  the  Inequality 


Kjj  >  t  <,/2>  (6.1,  (K.,)  /  2(SSE)/(b(b-l)  (k-1))  (3.2) 

Is  satisfied,  where  SSE  is  the  denominator  of  Equation  (3.1),  and  where  tp>m  Is 
the  pth  quantile  from  a  t  distribution  with  m  degrees  of  freedom.  This  Is  the 
well  known  Fisher's  LSD  procedure. 

For  Fisher's  randomization  test,  as  presented  by  Welch  (1937)  and  Pitman 
(1938),  the  F  statistic  Is  used,  but  not  the  F  distribution.  The  F  statistic  Is 
computed  for  each  of  the  (k!)b  configurations  of  the  observations,  obtained  by 
permuting  the  observations  within  blocks.  If  the  observed  F  statistic  Is  one  of 
the  (k!)b.  a  largest  of  these,  the  null  hypothesis  Is  rejected.  In  this  study 
k*3  and  b»5,  so  the  (3!)5(.05)  »  384  largest  values  of  F  constitute  the  critical 
region.  (The  actual  value  388.8  Is  rounded  down  to  the  first  multiple  of  6, 
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because  the  configurations  appear  In  a  multiplicity  of  6  and  the  alpha  level 
should  be  <  .05.)  The  actual  number  of  possible  F  values  greater  than  or  equal 
to  the  observed  F  value  Is  divided  by  (k!)b  to  obtain  "alphabet,"  sometimes 
known  as  the  p  value  or  the  critical  level. 

If  alphahat  Is  less  than  or  equal  to  .05,  multiple  comparisons  are  made  by 
permuting  only  those  observations  In  the  treatment  pair  being  considered. 

Because  there  are  effectively  only  (21)5  „  32  different  permutations,  the  treat¬ 
ment  pair  Is  declared  significantly  different  It'  and  only  if  all  pairwise  dif¬ 
ferences  have  the  same  sign,  at  a  level  of  significance  2/32  *  .0625.  For 
comparison  purposes  these  same  values  of  .05  and  .0625  were  used  In  the  F  test 
described  previously  and  In  the  following  test. 

The  third  test  Is  a  rank  transform  procedure  found  by  Iman  (1974)  and 
Conover  and  Iman  (1976)  to  have  good  properties  of  power  and  robustness  In  ran¬ 
domized  block  designs.  First  al1  b*k  observations  are  replaced  by  their  ranks 
from  1  to  b*k.  The  F  statistic  of  Equation  (3.1)  Is  computed  on  these  ranks  and 
compared  with  the  F  distribution,  k-1  and  (b-1)  (k-1)  degrees  of  freedom  as  an 
approximation  procedure,  just  as  In  the  first  method  described.  Multiple  com¬ 
parisons  are  made  using  Equation  (3.2)  just  as  In  the  parametric  case,  but  using 
the  same  ranks  used  above  rather  than  reranking  each  pair  of  samples  In  a 
Mann-Whltney  fashion. 

This  study  examines  normal,  lognormal  and  exponential  distributions,  under 
the  null  case  and  with  slight,  medium  and  strong  treatment  effects.  Under  each 
of  these  12  population-treatment  combinations  500  replications  were  made,  and 
the  three  tests  conducted.  Thus  6000  computations  of  the  F  test  (F)  and  the 
rank  transform  procedure  (RT)  were  made,  and  7,776,000  F  statistics  were  com¬ 
puted  for  the  randomization  test  as  a  different  null  distribution  must  be  found 
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TABLE  4.  The  population  effects  present  In  the  simulation  study:  means  of  the 
normal  (o*  «  4),  means  of  the  log  of  the  lognormal  ( er2  of  logs  ■  4), 
means  of  the  exponential.  Add  block  effects  (1,  2,  3,  4,  5)  to  the 


EFFECTS 

means  In  the  five  blocks. 

NORMAL 

LOGNORMAL 

EXPONENTIAL 

Null 

(0,0,0) 

(0.0,0) 

(0.0,0) 

Slight 

(0,0,1) 

(0,0,1) 

(0,0,1) 

Medium 

(0,1,2) 

(0,1,2) 

(0,4,6) 

Strong 

(0,1,3) 

(0,1,3) 

(0,7,9) 

In  each  case.  Specific  values  of  the  parameters  used  are  listed  In  Table  4. 

The  results  of  the  three  tests  are  summarized  in  Table  5  for  the  twelve 
situations  described  In  Table  4.  The  results  are  similar  to  the  results  for  CR 
designs  presented  in  the  previous  chapter.  That  is,  the  usual  F  test  on  the 
ranks  has  better  robustness  and  power  In  the  nonnormal  cases  examined  than  the  F 
test  on  the  data,  and  essentially  the  same  robustness  and  power  In  the  normal 
situation.  The  randomization  procedure  has  power  somewhere  between  the  power  of 
the  other  two  tests. 

TABLE  5.  The  percent  of  time  the  null  hypothesis  was  rejected  in  the  randomized 
complete  blocks  design,  five  blocks,  three  treatments. 

EFFECTS  NORMAL  LOGNORMAL  EXPONENTIAL 


R 

F 

RT 

R 

F 

RT 

R 

F 

RT 

Error  Rate 

In  Null  Case: 

Power  Under 

5% 

5% 

5% 

5% 

1% 

5% 

5% 

3% 

4% 

Slight  Effects: 

10% 

10% 

10% 

8% 

1% 

8% 

10% 

7% 

10% 

Medium  Effects: 

22% 

22% 

21% 

15% 

2% 

23% 

18% 

12% 

23% 

Strong  Effects: 

42% 

42% 

42% 

34% 

4% 

41% 

20% 

17% 

?7% 
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TABLE  6.  The  number  of  times  treatment  pairs  were  declared  significantly  dif¬ 
ferent  In  500  simulations,  using  an  RCB  design  with  three  treatments, 
five  blocks,  one  observation  per  cell. 


TREATMENT 
EFFECTS  "MITT"' 


R 


NORMAL 


DISTRIBUTION 


EXPONENTIAL 


NULL 

1,2* 

11 

14 

17 

9 

2 

13 

9 

8 

13 

2,3* 

12 

12 

18 

9 

4 

15 

12 

9 

13 

1,3* 

7 

11 

13 

12 

4 

18 

12 

10 

10 

SLIGHT 

1.2* 

15 

16 

21 

3 

0 

18 

4 

4 

10 

2.3 

34 

41 

42 

17 

4 

30 

18 

19 

23 

1,3 

29 

36 

33 

18 

4 

27 

11 

17 

23 

MEDIUM 

1,2 

41 

51 

52 

16 

2 

50 

37 

17 

81 

2,3 

37 

48 

50 

37 

6 

53 

26 

42 

37 

1,3 

74 

97 

96 

61 

5 

110 

60 

53 

109 

STRONG 

1,2 

51 

64 

65 

15 

0 

56 

51 

45 

106 

2,3 

96 

136 

135 

105 

20 

117 

22 

40 

27 

1,3 

163 

204 

204 

117 

19 

198 

64 

57 

118 

SIMPLE 

TOTALS: 

IOENTICAL 

POPULATIONS 

45 

53 

69 

33 

10 

64 

37 

31 

46 

SOME  EFFECTS 

PRESENT 

525 

677 

677 

386 

60 

641 

289 

290 

524 

♦These  populations  are  Identical. 

Multiple  comparisons  were  made  when  the  null  hypothesis  was  rejected 
using  the  previous  tests.  The  multiple  comparisons  results  given  In  Table  6  are 
similar  to  the  results  obtained  for  the  CR  design  In  the  previous  section. 
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Overall  the  rank  transformation  allows  more  real  differences  to  be  detected  tnan 
wher  either  of  the  other  two  procedures  Is  used. 

4.  CONCLUSIONS 

The  usual  F  test,  followed  by  Fisher's  LSD  procedure  for  multiple 
comparisons,  shows  approximately  the  same  robustness  and  power  as  Fisher's  ran¬ 
domization  test  and  the  rank  transform  procedure  when  the  populations  are 
normal,  slightly  less  power  than  the  other  two  procedures  with  exponential 
distributions,  and  considerably  less  power  than  the  other  two  procedures  when 
the  distributions  are  lognormal.  This  latter  result  may  be  due  In  part  to  the 
extreme  conservative  nature  of  the  parametric  procedure  under  the  lognormal 
distribution,  or  It  may  be  due  in  part  to  the  nonhomogeneity  of  variances  In  the 
models  considered.  Nonhomogeneity  of  variances  Is  a  natural  consequence  of 
positive  valued  data  when  the  means  are  different.  It  occurs  often  in  actual 
data  analysis,  so  no  attempt  was  made  to  alter  the  situation  In  this  study 
either. 

Fisher's  randomization  test  Is  a  difficult  and  time  consuming  procedure  to 
use  in  experimental  designs.  This  study  indicates  that  the  extra  work  required 
is  probably  not  justified,  because  while  Fisher's  randomization  test  shows 
better  power  and  robustness  overall  than  the  F  test  on  the  untransformed  data, 
it  compares  unfavorably  with  the  F  test  on  the  ranks  of  the  data. 

The  F  test  on  the  ranks  of  the  data,  with  the  subsequent  LSD  procedure  on 
the  ranks,  Is  an  easy  procedure  to  use.  It  has  essentially  the  same  power  as 
the  F  test  In  normal  situations,  and  more  power  than  either  the  F  test  or 
Fisher's  randomization  test  when  populations  are  lognormal  or  exponential,  at 
least  In  the  cases  studied. 


285 


REFERENCES 


Conover,  W.  J.  and  Iman,  R.  L.  (1976).  On  some  alternative  procedures  using 
ranks  for  the  analysis  of  experimental  designs.  Comm.  In  Statist.  -  Theo.  and 
Meth.,  A 5,  1349-1368. 

Conover,  W.  J.  and  Iman,  Ronald  L.  (1980).  Small  sample  efficiency  of  Fisher's 
randomization  test  when  applied  to  experimental  designs.  Presented  at  the 
National  Meeting  of  the  American  Statistical  Association,  Houston,  August. 

Owass,  Meyer  (1957).  Modified  randomization  tests  for  nonparametrlc  hypotheses. 
The  Annals  of  Mathematical  Statistics,  28,  pp.  181-187. 

Fisher,  R.  A.  (1935).  The  Oeslgn  of  Experiments.  New  York:  Hafner. 

Iman,  R.  1.  (1974).  A  power  study  of  a  rank  transform  for  the  two-way  classifi¬ 
cation  model  when  interaction  may  be  present.  Canad.  0.  of  Statist.  Sect.  C: 
Appl . ,  2,  pp.  227-239. 

Iman,  Ronald  L.  and  Conover,  W.  J.  (1980a).  Multiple  comparisons  procedures 
based  on  the  rank  transformation.  Presented  at  the  National  Meeting  of  the 
American  Statistical  Association,  Houston,  August. 

Iman,  Ronald  L.  and  Conover,  W.  J.  (1980b).  A  comparison  of  distribution  free 
procedures  for  the  analysis  of  complete  blocks.  Presented  at  the  National 
Meeting  of  the  American  Institute  of  Decision  Sciences,  Las  Vegas,  November. 

Pitman,  E.  J.  G.  (1938).  Significance  tests  which  may  be  applied  to  samples 
from  any  populations.  III.  The  analysis  of  variance  test.  Biometrlka,  29, 
pp.  322-335. 

Welch,  B.  L.  (1937).  On  the  z-test  in  randomized  blocks  and  latln  squares. 
Blometrika,  29,  pp.  21-52. 


286 


ALTERNATIVE  QUANTILE  ESTIMATION 


W.  D,  Kaigh 

The  University  of  Texas  at  El  Paso 


Abstract .  An  alternative  to  the  conventional  sample  quantile  Is  pro¬ 
posed  as  a  nonparametrlc  estimator  of  a  continuous  population  quantile. 

The  alternative  estimator  is  a  "generalized  sample  quantile"  obtained  by 
averaging  an  appropriate  subaample  quantile  over  all  subsamples  of  a  fixed 
size.  Since  the  resulting  statistic  is  a  U-statistic  with  representation 
also  as  a  linear  combination  of  order  statistics,  known  results  are  employed 
then  to  establish  asymptotic  normality.  The  alternative  estimator  is  shown 
to  be  asymptotically  efficient  in  the  class  of  nonparametrlc  models  specified 
by  Pfanzagl  (1975).  Analytic  results  and  Monte  Carlo  studies  with  moderate 
sample  sizes  indicate  that  the  proposed  estimator  usually  produces  mean 
square  error  of  estimation  less  than  that  of  the  conventional  sample  quantile 
and  also  jackknifes  to  provide  approximate  confidence  intervals. 

1.  Introduction.  Suppose  that  F  is  an  absolutely  continuous  c.d.f.  with 
corresponding  p.d.f.  f.  For  0  <  u  <  1  let  G(u)  **  inf  {x:  F(x)  <=  u)  be  an 
inverse  of  F  and  denote  the  derivative  G'(u)  =  l/f[G(u)]  when  it  exists. 


For  0  <  p  <  1  define  5  to  be  the  pth.  quanrile  of  F  which  satisfies  £  ■  G(p) 

P  P 

We  assume  throughout  that  f(£p)  51  0. 

Suppose  that  X.,  ....  X  are  i.i.d.  r.v.'s  with  c.d.f.  F  and  denote 
1  n 

the  corresponding  order  statistics  by  ....  X  .  Assuming  no  further 

information  regarding  F,  the  conventional  estimator  of  the  population 
quantile  £  is  the  pth.  sample  auantile  X, ,  ...  ,  ,  where  fx]  denotes  the 

integral  part  of  x.  The  asymptotic  distribution  of  the  sample  quantile  is 
given  by  the  following  well  known  result  (e.g.  Wilks  (1962),  page  273): 


<ll> . “-(‘iwiph.  -  s1” B(0-  <’ptm  • . - 

where 

ohf)  -  p(l-p)/f2(E  )  -  pCl-pHG*<p>]2  . 

P  P 

In  a  nonparamatr ic  context  assuming  a  positive  differentiable  p.d.f., 
Pfanzagl  (1975)  has  shown  that  the  sample  quantile  is  efficient  among  the  class 
of  all  translation-equlvarlant  and  asymptotically  median  unbiased  estimators. 
However,  Reiss  (1980)  has  demonstrated  that  quasiquantiles  may  perform  con¬ 
siderably  better  than  sample  quantiles  when  comparisons  are  based  on  the  notion 
of  deficiency  as  introduced  by  Hodges  and  Lehmann  (1970). 

Kaigh  and  Lachentruch(1981)  propose  and  study  another  alternative  to 
the  sample  quantile  in  an  attempt  to  improve  the  precision  of  the  estimation 
of  population  quantiles.  The  alternative  estimator  is  a  U-statlstlc  with 
representation  as  a  linear  combination  of  order  statistics  and  may  be  viewed 
as  a  "generalized  sample  quantile"  obtained  by  averaging  a  sample  quantile 
estimate  over  aubsamples  of  the  complete  sample.  Although  subaampling  schemes 
are  common  and,  in  fact,  our  generalized  sample  median  was  obtained  first  by 
Yanagawa  (1969)  as  a  robust  estimator  of  location  for  symmetric  distributions, 
the  procedure  provides  a  natural  local  "smoothing"  of  the  entire  sample 
quantile  function.  In  a  related  study  Harrell  and  Davis  (1981)  consider  a 
similar  quantile  estimator  obtained  through  application  of  the  bootstrap. 

In  Section  2  we  provide  the  Introduction  of  the  alternative  estimator 
and  a  discussion  of  its  elementary  properties;  in  Section  3  we  determine 
the  asymptotic  distribution  of  the  alternative  estimator  as  an  application  of 
known  results  concerning  linear  combinations  of  order  statistics  and  U-statistics 
in  Section  4  we  employ  both  analytic  methods  and  Monte  Carlo  results  to  compare 
the  alternative  estimator  with  the  conventional  sample  quantile  estimator; 
finally,  in  Section  5  we  Jackknife  the  alternative  estimator  to  obtain  in¬ 
terval  estimates  for  population  quantiles. 
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2.  The  Alternative  Estimator  K, 


.  For  a  fixed  Integer  k  satisfying 

tin 


1  £  k  n,  consider  the  selection  of  a  simple  random  sample  (without  replace* 


ment)  from  the  complete  sample  X^,  . ..,  XQ  and  denote  the  ordered  observations 
in  the  subsample  by  Y.  .  .  ,  . Y  .  .  An  elementary  combinatorial  argument 

x  •  KiD  k : 

shows  that  for  each  integer  r  satisfying  1  ^  r  <_  k 


P‘«r;k;p  •  Xj,„>  ■  «><G>'4>.  1  i  J  i  r  +  n  -  k  . 


For  0  <  p  <  1  a  sample  quantile  estimator  of  £p  based  on  the  observations  in  a 
single  subsample  would  be  We  define  alternative  quantile 

estimator  to  ^e  aubaample  quantile  averaged  over  all  (£)  sub- 

samples  of  size  k  so  that 


(2.1) 


r+n-k 


l<k+l)pj:k;n  *  <  kli>  /  ;  •  *  ’  «k«)p) 


The  estimator  of  (2.1)  is  obviously  tranalation-equivariant  (i.e., 

Kl(k+l)p] :k;n^Xl+c . Xn+c)  ”  X[(k+l)p)  :k;n(Xl . V  +  c)  *1"1 

Kt(k+l)p):k;rv  *  E<Vl(k.l)p]:k;n|Xl . Xn>  "ith  *»P«ct.tlon  Pr!ll(P).  the  «een 

of  the  r  ■  l(k+l)p]  th.  order  statistic  in  a  random  sample  of  size  k  from  F. 

From  the  development  through  averaging  the  symmetric  kernel  f*(x^,  x^) 

over  all  subsamples,  it  follows  that  .^.n  a  U-etatistic  with 

representation  also  as  the  linear  combination  of  order  statistics  given  by  (2.1). 
In  a  specialized  application  to  reliability  theory  Takahasl  (1970)  also  con¬ 
sidered  the  U-statlstlc  above  as  an  estimator  of  its  mean  ur,^(F).  The  weights 
which  appear  in  the  summation  of  (2.1)  correspond  to  the  probability  distribu¬ 
tion  of  a  negative  hypergeometric  random  variable  representing  the  number  of  lndl 
vldual  selections  (without  replacement)  required  to  obtain  a  total  of  r  "special 
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items"  from  a  dichotomous  population  consisting  of  exactly  k  "special  items" 
and  n-k  "ordinary  items".  The  mean  and  mode  of  the  negative  hypergeometric 
distribution  appearing  In  (2,1)  are  r(n+l)/(k+l)  and  [ (r-l)n/(k-l)]  +  lf  res¬ 
pectively,  which  indicate  a  weight  function  centered  appropriately  about  (np). 

A  sample  quantile  is  not  in  general  an  unbiased  estimator  of  the  cor¬ 
responding  population  quantile,  although  (1*1)  shows  that  any  bias  becomes 
negligible  with  increasing  sample  size.  Appeal  to  a  monotonicity  principle 
would  suggest  that  the  subsampling  scheme  provides  an  estimator  *k'n 

of  with  bias  magnitude  exceeding  that  of  the  conventional  estimator 
X((n+l)p]*n*  However,  would  seem  plausible  also  that  the  averaging  pro¬ 
cedure  might  result  in  a  reduction  of  sampling  variability  adequate  to  de¬ 
crease  the  overall  mean  square  error  of  estimation. 

Subject  to  the  obvious  constraint  1  <.  k  <_  n,  the  assumed  subsample  size 
is  arbitrary  and  the  choice  k  -  n  in  (2.1)  gives  Kf (n+1)pJ .n;n  "  X[(n+l)p]:n 
so  the  statistics  defined  by  (2.1)  form  a  collection  of  "generalized  quantile 
estimators"  which  Includes  the  usual  sample  quantile.  As  an  illustration 
consider  a  complete  sample  size  n  *  99  and  the  estimation  of  Permis¬ 

sible  subsample  sizes  are  then  k  ■  19,  39,  59,  79,  99  with  corresponding 
t(k*-l)p]  ■  1,  2,  3,  4,  5,  where  for  convenience  we  have  chosen  to  avoid  the 
use  of  fractional  order  statistics  (see  Stigler  (1977))  and  adopted  a  con¬ 
vention  that  a  quantile  is  estimable  from  a  sample  of  size  k  only  if 
(k+l)p  is  an  Integer.  The  estimation  problem  becomes  that  of  choosing  the 
subsample  size  appropriate  to  the  minimization  of  tk*n  **  ^p^2* 

AlthoOgh  the  theory  of  U-statlstlcs  as  developed  by  Hoeffding  (1948)  would 
suggest  a  choice  of  the  minimal  permissible  subsample  size  to  provide  a  ker- 
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nel  estimator  of  minimum  variance,  the  substitution  k«l  in  (2.1)  provides  the 
sample  mean  as  an  estimator  of  a  possibly  asymmetric  population  median.  Ob¬ 
viously  the  minimization  of  mean  square  error  of  estimation  requires  consi¬ 
deration  of  bias  magnitude  as  well  as  sampling  variance. 

Finally,  since  the  exact  distributional  properties  of  U-statlstlcs  and 
of  linear  combinations  of  order  statistics  typically  are  Intractable,  our 
subsequent  analyses  are  concerned  with  asymptotic  development  and  simulation. 
Although  the  robustness  properties  of  an  averaging  process  are  suspect,  in 
practice  the  estimator  K[ (k+l)p] ‘k'n  is  comrute<*  from  a  trimmed  sample  and  is 
quite  robust  provided  that  care  is  exercised  to  avoid  the  sample  extremes. 


3.  Asymptotic  Distribution  of  .^;n*  <^,ur  immediate  object iv?  is  to 

obtain  asymptotic  distribution  results  for  the  statistic  K,,.  ...  ...  to 

[ (k+l)p):k;n 

facilitate  comparison  of  the  estimators  introduced  in  Section  2.  Theorem  3.1 
requires  a  fixed  subsample  size  k  whereas  theorem  3.2  considers  a  subsample 
size  increasing  in  proportion  with  the  total  sample  size  n. 

For  a  fixed  subsample  size  we  first  formulate  and  then  apply  the  results 
of  Hoeffding  (1948).  Suppose  X^,  ...,  XR  are  i.i.d,  r.v.'s  and  let  f*(X^,  ...,  Xm) 
be  a  real-valued  symmetric  statistic  with  mean  n  and  second  moment 
E(f*(X^,  . ..,  X^)]2  <  oo.  The  corresponding  U-statistlc  for  n  is  then 

U»«l . V  ■  O'1  l  £*<X0, . X.  > 

Cl  in 

n 

where  C  indicates  that  the  summation  is  over  all  combinations  (a,,  ...,  a  } 
n  1  m 

of  ra  integers  selected  from  {1,  ...,  n).  Then  Un  has  expectation  n  for  all 
n  >_  m  and 
(3.1) 


n^(Un-n)  -  N(0,  m2?1) 
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where 


Cj.  -  Var[E(f*(X1,  .... 


Cj_  “  Var[E(f*(X1,  ....  X^JX^). 

2 

Moreover,  n  Var  UQ  Is  a  decreasing  function  of  n  with  limit  tn  . 

Let  r  ■  [(k+l)p]  and  recall  now  from  Section  2  that  *8  the 

U-statistic  for  wr.fc(P)  corresponding  to  the  kernel  f*(x^,  ...»  x^)  »  xr.^* 

r-1  k-r 

Denoting  the  beta  p.d.f.  mr,^(x)  ■  tl/B<r,k-r+l)]x  (1-x)  ,  0  <  x  <  1,  we  write 


the  expectation  as 
(3.2) 


Rt.k(F)  *  l  mr;k^  du’ 


Employing  the  formula  for  the  variance  of  the  projection  of  an  order  statistic  glvei 
in  lemma  2  of  Stlgler  (1969)  yields  a  convenient  representation  of  the  asymptotic 
variance  k2  Var  E(Xr-jJX^)  as 


(3.3) 


0*  (P)  ■  /  /  (u^v-uv)G'(u)G,(v)m  (u)m  (v)dudv. 
r  i  k  r*  r  •  k  r .  k 


Application  of  (3.1)  provides 
THEOREM  3.1.  For  0  <  p  <  1  and  k  fixed. 


n  (iW)p]:k;n-  yr:k(P))  *  N<0’  °r2:k(F))  89  n  -  where  r  "  t(k+1)p1' 

Although  not  presented  here,  a  multivariate  extension  of  theorem  3.1  follows  easily 

from  further  results  in  Hoeffding  (1948).  The  univariate  development  given  here 

appears  also  in  Takahasi  (1970)  and  it  should  be  noted  that  the  conclusion  requires 

only  the  existence  of  the  variance  of  the  rth.  order  statistic  in  a  random  sample 

of  size  k  from  F.  Also,  n  Var  K[(k+l)p] *k*n  decreases  with  limit  ^ (F>  of  (3.3). 

For  the  case  of  a  subsample  size  Increasing  in  proportion  with  the  total 

sample  size  we  formulate  the  results  of  Bickel  (1967).  Let  [m,  },  1  <  J  <  n, 

J  *n 

n  >  1  be  a  double  sequence  of  constants  such  that  m,  "0  for  j  £  5n,  j  >  (l-6)n 

J  *  n  — 

n 

for  some  6  >  0  and  consider  the  statistic  T  ■  E.  m.  X.  .  If  there  exists 

«  j*l  j.n  ):n 

M(u)  of  bounded  variation  on  I  ■  [6,  1  -  5]  such  that  Mn  (u)  “j5nu  ®j  n  •*  M(u) 
Definition:  uav  -  min(u,v). 
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on  a  dense  set  of  I  and  that  sup  (Mjj)  <  •  (V^  denotes  total  variation) ,  then 

(3.4)  n1^  -  E(Tn))  +  N(0,a2(M,F)) 

where 

o2(M,F)  ■  /*  /^u^sv-uv)^  (u)G' (v)dM(u)dM(v) . 

0  o 

Next  we  apply  the  above  version  of  theorem  4.1  of  Bickel  to  the  statistic 

K[ (k+l)p) :k;n  when  k/n  -  X,  0  <  X  <  1,  as  n  ^ 

The  negative  hyper geometric  probabilities  given  in  (2.1)  specify  the  pro¬ 
bability  distribution  of  a  random  variable  U  .  (n)  corresponding  to  the  rth. 
order  statistic  in  a  simple  random  sample  (without  replacement) from  the  finite 
population  (j/(n+l):l  £  j  £n}.  The  mean  and  variance  of  U  .  (n)  are  respec¬ 
tively  r/(k+l)  »  p  and  r(k-r+l)  (n-k)/  (k+1)  ^(k+2)  (n+1)  0  as  k,n  ».  It  fol¬ 
lows  by  ‘Chebyshev's  inequality  that  converges  in  distribution  to  the 

unit  mass  assigned  to  the  point  p.  Application  of  (3.4)  gives 
THEOREM  3.2.  For  0  <  p  <  1  and  k/n  +  0  <  X  <  1,  88  n  +  » 

^  <K((ktl,p)!k;»-  V  °"(0’  «p(F» 

where 

°p(F)  -  p(l-p)/f2($p)  -  pCl-pHG^p)]2. 

Although  not  developed  here,  it  follows  from  theorem  4.3  of  Bickel  (1967)  that 
the  conclusion  above  holds  whenever  k  tends  to  infinity  with  n,  provided  that 
F  has  finite  second  moment  and  that  is  replaced  with  u  j  (k-tl)pj  *k‘  ®*nce  UB<^er  che 
conditions  of  theorem  3.2  it  even  can  be  shown  that  n* (K^ (k+l)p] *k* n”  *((n+l)p]*n^  ® 
with  probability  one,  the  rationale  for  inclusion  of  Bickel' s  results  and  theorem  3.2 
is  the  demonstration  that,  in  a  certain  sense,  the  generalized  sample 
quantile  is  efficient  in  the  nonparametric  models  discussed  in  Section  1.  In 
addition,  the  development  of  theorem  3.2  illustrates  the  applicability  of  re- 
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suits  concerning  linear  combinations  of  order  statistic  to  the  alternative 
estimator.  Under  more  restrictive  hypotheses,  Bickel's  theorem  4.3,  In  fact, 
will  provide  our  theorem  3.1  since  n  “  t»r.k  (J/*0  and  Ur,k(n) 

has  asymptotic  beta  distribution  for  fixed  k  as  n  -*■  “. 

4.  Comparisons  of  the  Quantile  Estimators  Kr,.  ,n  ,  .  .  Although  preferences 

_ — _ i _ : _ u,K»Jjpj  :n,n 

among  competing  estimators  often  are  established  through  extensive  Monte  Carlo 
studies,  our  treatment  here  is  more  in  the  spirit  of  the  "small  sample  asympto¬ 
tics"  of  Stigler  (1977).  However,  some  simulation  results  are  included  primarily 
to  evaluate  the  adequacy  of  certain  analytic  approximations.  It  is  of  interest 
that  the  limited  small  sample  numerical  comparisons  in  Yanagawa  (1969),  (1970) 
suggest  merit  of  the  generalized  sample  median  in  the  specialized  application  as 
a  location  estimator  for  synmetrlc  distributions. 

As  an  initial  step  in  the  comparisons  we  consider  the  asymptotic  variances 

of  the  estimators  X,,  ...  .  and  Kr/.  ...  ,  .  .  The  equality  of  the  variances 

l(n+l)p]:n  I(k+l)p):k;n  ^  ’ 

given  in  (1.1)  and  theorem  3.2  indicate  asymptotic  equivalence  so  we  consider 
o^.k(F)  of  (3.3).  First  we  investigate  some  specific  distributions  which  per¬ 
mit  explicit  calculation  and  provide  some  insight  regarding  the  behavior  of  the 
alternative  estimator.  In  addition,  the  examples  supply  motivation  for  a  sub¬ 
sequent  approximation  and  its  limitations.  Although  the  result  probably  is 

available  elsewhere,  we  include  details  of  the  calculation  of  a  2 .(F)  for  the 

r  •  k 

uniform  distribution  since  the  derivation  is  probabilistic  and  possibly  new.  In 
the  other  example  we  simply  list  o*  (F)  of  (1.1)  and  a*, ^(F)  of  (3.3),  the  de¬ 
rivations  being  quite  similar.  We  assume  throughout  that  (k+l)p  and  (n+l)p  are 
Integers. 

EXAMPLE  4.1.  Standard  uniform  distribution.  Let  F(x)  ■  x,  0  <  x  <  1,  G(u)  *  u, 

0  <  u  <  1,  £  ■  p.  From  (1.1)  we  obtain 

°p2(F)  “  P(l-p)  ■  r(k-r+l)/(k+l)2. 
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(4.1) 


From  (3.3)  we  have 


(4.2)  a*jk(F)  “  2  l  C  u^1~'v^ar:ilMmr:^v^dudv’ 

An  easy  manipulation  of  the  integrand  provides 

cr:k(F)  "  t*(k-r+l)/(k+l)2]. 

2  f1  f'  (1/B(r+1,  k-r+l)B(r,k-r+?)]  ur(l-u)k“rvr"1(l-v)k'r+1dudv. 
o  © 

The  integral  above  admits  an  interpretation  as  the  probability  that  a  random 
variable  V  ^  d^8tr^uted  33  beta  with  parameters  r+1,  k-r+1  is  less  than 

a  random  variable  W  distributed  independently  as  beta  with  parameters 

r  •  |C »  i. 

r ,k-r+2.  Consider  two  independent  random  samples,  each  consisting  of  k+1  ob¬ 
servations  from  the  continuous  uniform  distribution  on  (0,1).  Then  it  follows 
that  be  cotnPuted  as  the  probability  that  the  (r+l)st. 

order  statistic  in  the  first  sample  is  less  than  the  rth.  order  statistic  in 
the  second  independent  sample.  A  combining  of  the  two  independent  samples  and 
an  elementary  combinatorial  argument  regarding  the  sample  origin  of  the  smallest 


2r  observations  shows  that 


p"Vl:kH  < 


,k+l..k+l  v,.2k+2. 

1  x  M2r-xm  2r  '* 


Symmetry  of  the  hypergeometric  distribution  indicated  above  provides 
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Here  both  estimators  are  unbiased  for  £p»  so  it  follows  from  (4.4)  that 
K[ (k+l)p] -k’n  haa  <aey«Ptotic>  oean  a9uar®  error  less  than  that  of 
X((n+l)p]*n  ^or  an^  al^owa'>^e  subsample  size  if  the  population  c.d.f.  is  that 
of  the  uniform  distribution  on  (0,1). 


**x  * 

EXAMPLE  4.2.  Standard  logistic  distribution.  Let  F(x)  ■  (1+e  )  , 

-  “  <  x  <  *  ,  C(u)  ■  log[u/(l-u)] ,  0  <  u  <  1,  5  ■  lbg[p/(l-p)] . 

Then 

oJ(F)  -  l/p(l-p)  -  (k+l)2/r(k-r+l). 

°r:k(F)  "  tk2/ (r-1)  (k-r)  ]  [1  -  (£J)  »  1  <  r  <  k. 

EXAMPLE  4.3.  Standard  exponential  distribution.  Let  F(x)  ■  1-e  X, 

0  <  X  <  G(u)  -  -log(l-u) ,  0  <  u  <  1,  5  »  -log(l-p). 

Then 

Op(F)  -  p/(l-p)  -  r/ (k-r+1) 

°J:k<F)  ■  I'/(k-r)Hl  -  1  <  r  <  k. 

Vc 

EXAMPLE  4.4.  Standard  power  function  distribution  Os) •  Let  F(x)  ■  x  , 

0  <  x  <  1,  G(u)  ■  u2,  0- <  u  <  1,5^  “  P^* 

Then 

o2(F)  ■  4p3(l-p)  ■  4r3 (k-r+1) /(k+1)^ 

P 

o2;k(F)  -  [4r2(r+l)(k-r+l)/(k+l)2(k+2)2J 

r  *  ,k+2wk+2,/,2k+4. ,  .  „  . 

*  [1  -  WW/W1’  1irik- 


Although  many  standard  distributions  such  as  the  normal  do  not  possess 
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inverse  cuoulatives  which  permit  calculation  of  (3.3) ,  the  use  of  Tukey's 
lambda  distributions  (see  Joiner  and  Rosenblatt  (1971)  and  Ramberg  and 
Schmeiser  (1974  can  provide  suitable  approximations.  However ,  as  an  alter¬ 
native  approach  we  observe  instead  that  the  examples  presented  above  suggest 

an  approximation  of  o2  .  (F)  adequate  at  least  for  qualitative  purposes.  As 

r«  K 

our  primary  objective  is  to  ascertain  the  behavior  of  the  alternative  esti¬ 
mator  over  a  large  class  of  "well-behaved"  distributions,  we  implicitly  as¬ 
sume  throughout  the  necessary  smoothness  conditions  on  the  inverse  c.d.f.  G. 

Computation  of  the  variance  ratios  .  ,i  the  preceding  examples  suggests 
that  for  r,r-k,  and  k  of  moderate  si2e 

(4.5)  °p(F)/o?:k(F)  "  (1  “  (t^1Hkr1>/(2^2)i’1  >  1. 

NOTE.  Examination  of  o2  .  (F)  in  examples  4.2  -  4.4  indicates  the  importance 
of  the  qualifying  statement  "of  moderate  size". 


Now  the  respective  mean  and  variance  of  the  beta  density  are  r/(k+l)  ■  p 

2 

and  r(k-r+l) /(k+1)  (k+2)  ■  p(l-p) /(k+2) .  Assuming  that  the  continuous  p.d.f. 
f  is  relatively  constant  near  the  approximation  G' (x)  =  G’(p)  in  (3.3) 
provides 


a2.k(F)  =  (G* (p)  3 2  2 /l/v  u(l-v)mr;k(u)mr>k(v)  dudv. 


The  integral  above  is  precisely  that  of  (4.2)  computed  in  the  uniform  case  of 
example  4.1  so  we  obtain 

2  —  _  /  i  _\r/^t/_\i2f,  /k+1  v  ,k+l  v  ,  /  2k+2. 


;2.k(F)  3  p(i-p) (g1  (p) ]  [i  -  rpr;*)/c , )i 


(4.6) 


7,.,,,  /k+1.  /k+1.  ,,2k+2.  ,  r  /,  , , ,  i 

OpClMl  -  (  r  )(  r  )/(  2r  )],  r  «*  [  (k+l)p] . 


The  preceding  presents  some  justification  of  (4.5)  and  the  resultant  implication 
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that  the  ratio  of  the  asymptotic  variances  o*(F)/c2  .  (F)  is  "approximately 

P  tiK 

independent"  of  P  for  r,  r-k,  and  k  of  moderate  size.  Another  consequence  of 
(4.5)  is  the  suggestion  that  the  ratio  of  the  asymptotic  vari~ 
ances  decreases  to  1  with  increasing  subsaopie  size. 

To  investigate  the  adequacy  of  the  approximations  of  theorem  3.1  and 
(4.5)  we  performed  10,000  simulations  of  median  estimation  for  seven  symmetric 
distributions  based  on  a  complete  sample  Blze  n  -  99  and  subeample  sizes 
k  *  9,  19,  29,  39,  79.  The  results  appear  in  Table  4.1  and  are  qualitatively  aa 
predicted  and  quantitatively  in  quite  reasonable  agreement  with  (4.5) (and  the 
results  of  examples  4.1  and  4.2). 

Recall  now  that  the  formulation  of  theorem  3.1  provides  an  estimator  of 
which  is  asymptotically  biased  in  most  cases.  The  simulations  of  Table  4.1 
were  selected  to  avoid  confounding  bias  considerations  (both  estimators  are 
unbiased  for  the  medians  of  these  symmetric  distributions)  so  that  the  theo¬ 
retical  variance  ratios  equal  the  theoretical  mean  square  error  ratios.  In 
the  more  general  case,  a  decrease  in  variance  by  the  subsampling  scheme  can 
be  insufficient  to  achieve  the  desired  reduction  in  mean  square  error  of  es¬ 
timation.  It  is  clear  then  that  asymptotic  bias  magnitude  should  be  considered 
in  the  evaluation  of  the  alternative  estimator. 

For  this  objective,  ignoring  all  but  the  first  three  terms  of  a  Taylor's 
expansion  of  G  in  (3.2)  gives 

Pr;k(«  ‘  Sp  2  p(l-p)G"(p)/2(k+2) 

which  in  conjunction  with  (4.6)  provides  the  approximate  mean  square 
error 
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MSEr:k(F>  “  °r:k(F)/n  +  lur:k(F)  '  V* 

-  p(l-p)[G'(p)]2{l  -  (^)(*l>(2£2)]/n 

+  [p(l-p)G"(p)]2/4(k+2)2,  r  -  t(k+l)p] . 

Similarly.  x[(n+i)p].n  1488  approximate  mean  square  error 

HSEp(F)  -  »p2(F>/»*  l»[(M1)p,!0<F>  -  tp!2 

•  p(l-p)lG*(p)]2/n  +  [p(l-p)G' ’ (p)  ]2/4(n+2)2. 

It  follows  that  for  moderate  r,r-k,  and  k  and  large  n 

(4.7)  MSEp(F)/MSEr;k(F)  =  {1  4-  P(l-p) (G» ' (p) /G‘ (p) ]2/4n} 

•  {[1  -  <k+1)(k*1)/(22*2)]  +  n  p(l-p)[G,’(p)/G,(p)]2/4k2}"1  ,  r  -  [(k+l)p] 

If  k  is  not  too  small,  both  bias  terms  in  (4.7)  are  negligible  and  the  remarks 
Immediately  following  (4.6)  apply  to  mean  square  error  as  well. 

Finally  in  Table  4.2  we  present  further  simulation  results  for  both  sym¬ 
metric  and  asymmetric  distributions  for  n  -  99,  k  ■  39,  k  ■  79,  and  p  varying 
from  0.05  to  0.95.  The  results  are  in  quite  reasonable  agreement  with  (4.5) 

(the  bias  terms  in  (4.7)  are  indeed  negligible)  for  0.2  <  p  <  0.8.  The  alter¬ 
native  estimator  performed  better  than  the  conventional  estimator  for  all  de¬ 
ciles  of  all  distributions  except  the  heavy-tailed  double  exponential  and  Cauchy 
for  k  -  39,  79.  Problems  were  encountered  for  extreme  quantiles  of  the  power 
function,  exponential,  logistic,  and  normal  distributions,  also.  The  corres¬ 
ponding  values  of  r  were  2  and  38  (not  moderate)  indicating  difficulty  with 
the  accuracy  of  (4.5)  and/or  bias  magnitude.  However,  it  should  be  noted  that 
the  intent  of  Table  4.2  is  to  suggest  the  existence  of  a  single  subsample  size 
k  providing  simultaneously  better  estimates  for  a  spectrum  of  quantiles  over  a 
class  of  different  distributions,  and  a  larger  subsample  size  would  eliminate 
the  aberrant  cases.  In  practice  different  subsample  sizes  probably  should  be 
employed  for  estimation  of  different  quantiles. 


299 


TABLE  4.2  Ratio  of  Mean  Square  Error  +  ^  j,n  to  Mean  Square  Error  K^k  +  Based  on 

10,000  Simulations. 


5.  Quantile  Interval  Estimation.  The  asymptotic  results  for  the  point  estimators 
X,,  ...  ,  and  Kf/.  ...  ,  .  may  be  employed  to  obtain  large-sample  confi- 
dence  intervals  for  population  quantiles  provided  that  sample  estimates  may 
be  obtained  for  the  asymptotic  variances  of  (1.1)  and  (3.3).  Here  we  consider 
application  of  the  jackknife  procedure  to  obtain  sample  variance  estimates  of 
the  generalized  sample  quantile,  although  an  alternative  method  using  the 
sample  quantile  function  and  tables  of  incomplete  beta  functions  is  described 
by  Mar it z  and  Jarrett  (1978). 

First  we  develop  briefly  the  jackkknlfe  estimator  of  an  unknown  population 

*o 

parameter  6  based  on  a  random  sample  X. ,  . . . ,  X  .  Let  9  be  the  estimate  of 

Inn 

6  based  on  all  n  observations  and  let  6*  ,,  1*1,  ...,  n,  be  the  estimate 

n-1 

obtained  by  deletion  of  the  ith.  observation.  The  pseudo-values  are  defined 
by 

®i  “  n  ®n  "  <n“1>  ®n-l’  1-1 »  n 

and  the  jackknife  estimator  of  6  is  then 

-  n 

e  "i£i  Vn- 

A  sample  estimate  of  the  variance  of  the  jackknife  estimator  is  given  by 
S~  ■  S^/n  where 

s2  “  i5l  (®i  *  §)2/ (n-1) . 

2 

Under  certain  conditions  (e.g.  Miller  (1964))  S*  is  consistent  and  the 
standardized  statistic  (6  -  0)/S;  is  asymptotically  standard  normal.  The 

O 

jackknife  estimator  may  be  employed  then  as  a  pivotal  statistic  for  robust 
interval  estimation  of  0. 

Although  the  sample  quantile  represents  a  classic  failure  of  the  jackknife 
procedure  (see  Efron  (1979)),  we  show  that  the  generalized  sample  quantile 
estimator  "jackknifes  well"  and  the  asympototic  behavior  of  the  statistic 
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K  .  under  the  jackknife  follows  easily  by  application  of  results  of  Arvesen* 

f 

(1969)  concerning  U-statistics. 

Deletion  of  the  ith  observation  provides  pseudo-value  nK  .  -  (n-1)  . 

r • k  »n  r»K 

where  the  weights  required  for  the  computation  of  K1  .  .  are 

(J’J) (n~J~l) /  (n^1).  J-r . rfn-k-1.  Since  Kr;k;n  is  a  U-statistic  the 

2 

pseudo-values  provide  average  K  .  and  sample  variance  S  .  given  by 

t i Kjn  r:*,n 

s2  .  -  (n-1)  E  (K1  .  ,  -  K  .  )2. 

f-lc-n  1~\  irslCsti 

Application  of  Arvesen's  theorem  6  in  conjunction  with  our  theorem  3.1  provides 

THEOREM  5.1.  For  r  and  k  fixed,  asn  +  « 

2  ^2 

i)  r :k;n  °r:k  (F) 


115  n  <W,'r:k<F,>/8r,k,«*"(0»l)* 

We  remark  that  the  results  of  Parr  and  Schucany  (1981)  concerning  jackknifed 
linear  combinations  of  order  statistics  will  produce  theorem  5.1  i  nder  more 
restrictive  conditions  on  the  c.d.f.  F. 

The  preceding  yields  an  approximate  1-a  confidence  Interval  for  the  popu¬ 
lation  quantile  given  by 


<5’l>  Kr:kin  —  ^Va/Z)  Stlk;11/n\  r  -  t(k+l)pl 

where  4>  is  the  standard  normal  c.d.f. 

To  investigate  the  validity  of  (5.1)  for  small  and  moderate  sample  sizes, 
additional  simulations  were  performed  for  the  uniform  and  exponential 
distributions.  Since  the  number  of  distinct  pseudo-values  obtained  is  n-k+1, 
the  standard  normal  percentage  points  in  (5.1)  were  replaced  by  those  of  the 
t  distribution  with  n-k  degrees  of  freedom.  Results  of  median  interval 
estimates  for  the  uniform  distribution  based  on  various  sample  sizes  appear 
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In  Table  5.1  while  results  of  other  quantile  Interval  estimates  for  the  uniform 
and  exponential  distributions  based  on  a  single  moderate  sample  size  appear 
in  Table  5.2.  Taking  into  account  Monte  Carlo  variability,  there  is  only 
small  deviation  of  the  empirical  confidence  levels  from  the  nominal  levels 
even  for  small  sample  sizes.  Although  further  simulation  studies  should  be 
performed,  the  results  of  Section  4  suggest  adequate  validity  of  (5.1) 
for  other  distributions  as  well. 

BIBLIOGRAPHY 

Arvesen,  J.  N.  (1969).  Jackknifing  U-statistics.  Ann.  Mach.  Statist.  40 
2076-2100. 

Bickel,  P.  J.  (1967).  Some  contributions  to  the  theory  of  order  statistics. 
Proc.  Fifth  Berkeley  Symp.  Math.  Statist.  Prob. I  575-591. 

Efron,  B.  (1979).  Bootstrap  methods:  another  look  at  the  jackknife. 

Ann.  Statist.  7  1-26. 

Harrell,  F.  E.  and  Davis,  C.  E.  (1981).  Improved  distribution-free  quantile 
estimation.  In  preparation. 

Hodges,  J.  L.  and  Lehmann,  E.  L.  (1970).  Deficiency.  Ann.  Math.  Statist.  41 
783-801. 

Hoeffdlng,  W.  (1948).  A  class  of  statistics  with  asymptotically  normal 
distribution.  Ann.  Math,  Statist.  19  293-325. 

Joiner,  B.  L.  and  Rosenblatt,  J.  R.  (1971).  Some  properties  of  the  range 
in  samples  from  Tukey's  symmetric  lambda  distributions.  J .  Amar . 

Statist,  Assoc.  66  394-399. 

Kaigh,  W.  D.  and  Lachenbruch,  P.  A.  (1981).  A  generalized  quantile  estimator. 
To  appear. 

Maritz,  J.  S.  and  Jarrett,  R.  G.  (1978).  A  note  on  estimating  the  variance 
of  the  sample  median.  J.  Amer.  Statist.  Assoc.  73  194-196. 

Miller,  R.  G.,  Jr.  (1964).  A  trustworthy  jackknife.  Ann,  Math.  Statist.  35 
1594-1605. 

Parr,  W.  C.  and  Schucany,  W.  R.  (1981).  L-statistics  with  smooth  weight 
functions  jackknife  well.  To  appear. 


304 


Pfanzagl,  J.  (1975).  Invesclgatlng  the  quantile  of  an  unknown  distribution. 
In:  Statistical  Methods  in  Biometry  (dedicated  to  A.  tinder)  (ed. 

W.  J.  Ziegler)  111-126,  BirkhMuser  Verlag,  Basel. 

Ramberg,  J.  S.  and  Schmeiser,  B.  W.  (1974).  An  approximate  method  for 
generating  asymmetric  random  variables.  CACM  17 ,  2  78-82. 

Reiss,  R.  -D.  (1980).  Estimation  of  quantiles  in  certain  non-parametrlc 

models.  Ann,  Hath.  Statist.  8  87-105. 

Stlgler,  S.  M.  (1969).  Linear  functions  of  order  statistics.  Ann.  Math. 
Statist.  40.  770-788. 

Stlgler,  S.  M.  (1977).  Fractional  order  statistics,  with  applications. 

J.  Amer.  Statist.  Assoc.  72  544-550. 

Takahasl,  K.  (1970).  Estimation  of  several  characteristics  of  distributions 
of  order  statistics.  Ann.  Inst.  Statist,  Math.  22  403-412. 

Yanagawa,  T.  (1969).  A  small  sample  robust  competitor  of  Hodges -Lehmann 
estimate.  Bull.  Math.  Statist.  13  1-14. 

Yanagawa,  T.  (1970).  On  some  robust  estimrte  of  a  location  parameter. 

Mem.  Fac.  Gen.  Educ.,  Kumamoto  Univ.,  Ser.  Nat.  Sci.  5  7-16. 


Wilks,  S.  S.  (1962).  Mathematical  Statistics.  Wiley,  New  York. 


305 


TABLE  S.l  Quantile  Interval  Estimation  K 


+  t 


{ (k+l)p] :k;n  —  n-k,a/2  [(k+l)p]:k;n 


/n ' 


Based  on  1,000  Simulations. 


p  •  0.5  Uniform  Distribution 

Observed  Ratio  of 


Sample  Size 

Confidence  Level 

Subsample  Size  0.68*  0.93* 

Observed  Variance 
to  Mean  Jackknife 
Variance  Estimate 

n-99 

k-79 

0.66 

0.92 

0.93 

k-39 

0.67 

0.93 

1.01 

n-49 

k-39 

0.66 

0.91 

0.94 

k-19 

0.68 

0.92 

1.01 

n-39 

k-19 

0.68 

0.94 

0.92 

k-  9 

0.70 

0.95 

0.95 

n-29 

k-19 

0.65 

0.92 

0.94 

k-  9 

0.66 

0.93 

0.98 

n-19 

k-  9 

0.68 

0.93 

0.91 

*nominal  confidence  level 
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TABLE  5.2  Quantile  Interval  Estimation  K 


Based  on  1,000  Simulations. 


Uniform  Distribution 

Observed  Confidence 
Level 


_ E — 

0.68* 

0.95* 

0.10 

0.65 

0.91 

1 

i 

r 

0.20 

0.65 

0.92 

* 

» 

i 

0.30 

0.66 

0.94 

i 

I 

0.40 

0.66 

0.93 

T 

t 

0.50 

0.67 

0.93 

i 

0.60 

0.66 

0.94 

\ 

i 

0.70 

0.67 

0.93 

♦ 

i 

( 

0.80 

0.68 

0.92 

i 

0.90 

0.67 

0.93 

^nominal  confidence  level 
n-99,  k-39 
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:k;n  ±  Vk.a/2  S(  (fcfl)p]  :k;n/n 


Exponential  Distribution 

Observed  Confidence 
Level 

0.68*  0.95* 

0.66  0.92 

0.66  0.93 

0.67  0.94 

0.67  0.93 

0.67  0.93 

0.66  0.95 

0.67  0.94 

0.67  0.93 


0.65 


0.94 
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Abstract.  The  problem  of  nonparametric  probability  density  estimation  is 
considered  for  higher  dimensions.  An  "onion  peel"  algorithm  is  suggested  for 
3 'dimensions.  For  dimensions  of  4  or  more,  a  decomposition  procedure  is 
proposed,  which  first  finds  the  centers  of  mass  using  nearest  neighbor  tech¬ 
niques,  then  estimates  the  density  around  these  centers  using  fixed  mesh  pro¬ 
cedures. 
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Introduction 


There  are  many  reasons  for  the  possible  failure  of  standard  parametric 
statistical  procedures.  Among  these,  the  problem  of  tailiness  beyond  that 
in  the  model  assumed  has  attracted  the  most  interest.  As  one  example,  for 
some  years  now,  rank  tests  have  bean  used  as  an  alternative  to  likelihood 
ratio  tests  [7].  More  recently,  notions  of  robustness  as  delineated  in  the 
Princeton  Robustness  Study  have  moved  to  center  stage  In  statistical  inves¬ 
tigation  (  1].  Both  these  sets  of  techniques  tend  to  assume  synsuetry  and 
unimodality  of  the  underlying  distributions.  Both  are  somewhat  tied  to  one 
dimensional  probability  densities. 

A  second  type  of  pathology,  and  the  one  to  which  we  shall  address  our¬ 
selves  in  this  paper,  is  departures  of  the  underlying  distributions  from 
unimodality  and  symmetry.  In  this  case  protection  against  tailiness  will  be 
of  little  avail.  Procedures  are  required  which  will  be  robust  against  the 
unexpected  "in  the  center." 

Of  such  techniques,  the  oldest  is  the  histogram,  which  existed  in  crude 
form  as  long  ago  as  1662  (4].  The  "shifted  histogram"  of  Rosenblatt  (12] 
gave  greater  efficiency  and  flexibility  than  those  of  the  histogram.  The 
still  more  general  kernel  estimates  of  Parzen  fn]  have  found  wide  appllcabil 
ity 

Another  approach  hts  been  that  of  series  estimates  [  6,  15).  These 
have  a  loyal  group  of  users  but  do  not  presently  enjoy  the  popularity  of 
kernel  estimates. 

A  suggestion  of  Good  and  Gaskins  [  31  to  pose  Bayesian  estimation  in  a 
function  space  setting  for  density  estimation  (with  a  prior  measure  on  the 
space  of  densities)  was  successfully  pursued  by  de  Montrlcher  fiol.  However, 
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the  practical  difficulties  of  algorithmic  imp lamentation  have  given 
preference  to  the  related  concept  (alto  suggested  by  Good  aud  Gaskins)  of 
maximum  penalised  likelihood  density  estimation  [  14,  15  ].  This  algo¬ 
rithm  has  been  Included  as  a  standard  routine  in  the  widely  disseminated 
IMSL  package  [  5  ] . 

The  three  categories  of  density  estlmatlon--hlstogram  (including  the 
shifted),  series,  and  maximum  penalized  likelihood —  are  by  no  means 
exhaustive  of  the  techniques  robust  against  the  possibility  of  multimodality, 
but  are  the  most  commonly  used.  Each  of  these  can  be  generalized  to  several 
dimensions.  The  technique  used  in  this  paper,  however,  la  based  on  the 
shifted  histogram. 


Discussion 


The  nonparametric  estimation  of  densities  in  higher  dimensions  presents 
the  Investigator  with  difficulties  not  encountered  in  the  well  explored  one 
dimensional  case.  If  we  use  evaluation  on  a  standard  fixed  mesh  grid,  we 
have  the  problem  of  exponentially  exploding  cost  of  computation  with 
increasing  dimension.  Moreover,  with  kernel  (shifted  histogram)  techniques 
we  face  the  empty  space  problem--  namely  the  vast  majority  of  grids  will 
contain  no  data  points.  So  a  great  deal  of  our  computation  will  be 
effectively  wasted.  A  preferred  procedure,  then,  would  be  to  use  a  variable 
grid  which  increases  in  size  in  regions  of  low  density  but  decreases  in 
regions  with  many  data  points. 

This  leads  us  to  the  k-neareet  neighbor  algorithm  [2,8].  To  delineate 
it,  we  let 


(1) 


f  (x) 


k  1 

N  Vm(x,d(x,k)) 


where  d(x,k)  =  Euclidean  distance  to  the  kth  nearest  data  point  from  x 

V  (x,d(x,k))  =  the  volume  of  the  m-dimensional  sphere  centered  at  x 
m 

with  radius  d(x,k) 

N  =  the  sample  size. 

We  note  that  as  k  increases,  the  variability  of  our  estimate  for  f 
decreases,  but  at  the  expense  of  Increased  bias.  Sufficient  conditions  for 
consistency  of  the  estimate  in  (1)  are  (p.84,  [15]) 


(2)  lim  k  =  <*> 

lim  k/N  =  0. 
N-»  -x> 
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For  density  estimation  problems  in  1,  2,  or  3  dimensions,  it  is  an  «asy 
matter  to  choose  the  appropriate  k  interactively. 

As  yet  ,  no  completely  automated  rule  for  the  selection  of  h  is 
available,  although  an  iterative  procedure  developed  for  fixed  kernel  width 
selection  [13]  appears  well  suited  to  this  task  using  the  formula  for  the 
mean  square  error  of  nearest  neighbor  estimates  given  in  [  91  .'  Tor 
dimensional  densities  (1,  2,  or  3),  it  is  not  difficult  to  choose  k  inter¬ 
actively.  We  simply  start  with  k  large--  say  N/2--  and  sequentially 
reduce  it  by  powers  of  2  until  the  graphs  of  the  estimated  density  begin 
to  display  high  frequency  wiggles.  Then  we  return  to  the  preceding  value 
of  k. 

A 

It  is  interesting  to  note  that  it  is  the  graphing  of  f  (or  some 
machine  alternative  to  graphing)  which  is  the  greatest  problem  in  density 
estimation  in  higher  dimensions.  The  use  of  a  data  based  "grid"  does  not 
liberate  us  from  the  curse  of  dimensionality.  As  an  example  of  this  point, 
suppose  we  have  a  sample  of  300  from  a  5  dimensional  density.  An  investi¬ 
gator  who  estimated  the  density  using  fixed  mesh  (20/dim.)  would  be 

A  5 

required  to  evaluate  f  at  3.2  X  10  points.  The  nearest  neighbor 
advocate  might  argue  with  some  validity  that  we  could  make  do  with 

A 

evaluating  f  only  at  the  300  data  points.  The  argument  for  this  attitude 
might  be  that  he  la  Interested  at  points  where  f  is  large--  and  these  are 
roost  likely  to  be  near  the  data  points.  But  what  sense  can  he  make  of 

A 

(f(Xj);  i  =  1 ,2, . . . ,300] ?  He  must  somehow  exploit  the  assumed  continuity  of 
f  to  "get  a  picture"  of  it.  In  one  dimension,  the  eye  itself  would  perform 

A 

this  task  from  a  simple  plotting  of  [f(x^);  i  =  1,2, . . . ,300] .  In  two 
dimensions,  one  would  need  to  use  some  care  in  selecting  the  appropriate 
graphical  technique  (2  dimensional  contour  plots,  3-d  Calcomp  plots,  etc.). 


In  3 -dimension a  (which,  due  to  the  added  dimension  from  f,  is  really  a 
4-d  plotcing  problem),  one  will  have  to  be  somewhat  clever.  And  in  higher 
dimensions,  where  our  essentially  3-d  perceptions  fail  us,  what  to  do  is 
unclear.  There  is,  unfortunately,  a  vast  difference  in  knowing  the  functional 
form  of  f  and  simply  knowing  it  on  a  regular  (let  alone  an  irregular)  mesh. 

A  knowledge  of  f  on  the  continuum  would  return  us  to  the  happy  world  of  para¬ 
metric  probability  densities  (a  low  dimensional  problem).  A  knowledge  of 
f  at  a  discrete  number  of  points  leaves  us  with  a  problem  of  high 
dimensionality.  Of  course,  we  shall  not  even  know  f  at  a  finite  number 
of  points--  only  an  estimate  f.  But  from  a  practical  point  of  view. 
Inferential  difficulties  would  remain--  even  if  we  knew  £  exactly  at  a 
discrete  number  of  points . 

Let  us  consider  the  following  question:  would  we  rather  have  a  random 
sample  of  300  from  our  unknown  5  dimensional  density  f  or  would  we  rather 
know  f  precisely  at  300  points  selected  from  a  uniform  distribution  over 
the  5 -dimensional  hypercube  in  which  we  know  a-priori  the  bulk  of  the  mass 
of  f  is  imbedded?  A  little  thought  reveals  that  the  first  of  the  two 
cases  is  the  more  informative  (though  we  would  surely  pick  the  second  for 
the  1 -dimensional  problem)  on  those  regions  having  the  greatest  density. 

This  again  argues  against  fixed  mesh  width  shifted  histogram  estimation  and 
in  favor  of  nearest  neighbor  techniques  in  higher  dimensions. 

But  it  also  points  us  toward  the  desirability  of  focusing  on  local 
centers  of  high  density.  Let  us  consider  a  three  dimensional  ballistics 
data  set.  As  a  first  step  we  translate  the  data  to  the  sample  mean  and 
rescale  it  so  that  the  marginal  sample  variances  are  equal.  We  now  consider 
the  estimation  of  f  in  the  three  planes  MV  =  0,  $  =  0,  9  =  0.  We  show, 
in  Figure  5,  the  procedure  whereby  this  estimation  is  carried  out.  Some 


Important  but  mathematically  trivial  computational  savings  can  be  made.  For 
example,  suppose  we  have  determined  the  distance  d(Q,pp  of  a  point  P^, 
with  coordinates  from  a  sample  point  Q  with  coordinates 

(x,y,s).  Then  going  to  the  next  point  P^x^  +  6,yl  tn  the  8rid,  we 

have  the  simple  update  formula: 


(3)  d2(Q,P2)  *  d2(Q,P1)  +  62  +  26  (x  -  x^  . 

The  gain  in  computational  efficienty  using  this  simple  update  formula  i9 
of  the  order  of  2  to  3  (note  that  if  we  had  not  used  a  regular  grid,  this 
saving  would  not  have  been  available  ). 

Next,  we  note  that  if  we  use  (1)  for  estimating  £(x^)  for  a 
predetermined  k  *  pN  it  is  not  essential  that  we  use  precisely  this  value 
of  k  in  the  formula,  as  long  as  we  know  what  k  is.  Consequently,  we 
select  randomly  a  subset  of  the  N  data  points  of  size  M  =  2r  (with, 
typically,  r  =  6  or  7).  Then  we  find  the  distance  to  the  2s th  nearest 
neighbor  to  the  grid  point  x^^  where  2S/2r~  p.  Call  this  distance  d. 
Returning  to  the  full  data  set,  count  the  number  of  sample  points  at  least 
as  close  to  x^  as  d  let  the  number  of  such  points  be  called  k'(^k). 
Then  we  use  the  formula 


(3) 


.  kj.  l 

f(xl}  N  V  (x  d) 
m  1 


The  information  loss  caused  by  thiB  latter  "pilot  study"  algorithm  is 
negligible,  while  the  improvement  in  computational  efficiency  is  of  the 
order  [ log2N/log2M] . 

For  each  of  the  three  planes,  we  now  interpolate  to  obtain  the 

A 

(conditional)  iso-f  level  curves.  (Such  curves  lor  MV  =  0  are  given  in 
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A 

Figure  6.)  Me  note  that  such  curves  will  enclose  less  and  less  area  as  £ 

A 

increases.  Connecting  points  on  the  level  curves  for  a  fixed  value  of  f 
gives  us  the  level  surface  (with  MV  coming  out  of  the  page)  in  Figure  7.  We 

A 

next  let  f  increase  to  give  us  the  level  surface  in  Figure  8.  We  continue 

A 

to  increase  f  until  first  one  bump,  then  the  other  disappears.  In  using 
the  "onion  peel"  procedure  for  the  present  problem,  it  was  noted  that  at  the 

A 

f  levels  of  disappearance  of  the  two  bumps,  the  MV  values  were  identical  - 
thus  indicating  only  that  only  one  modal  MV  value  is  appropriate.  Naturally, 
we  might  find  it  desirable  to  make  one  or  two  additional  set9  of  onion  peel 
plots  in  determining  the  coordinates  of  the  modes  (each  corresponding  to  one 
of  the  two  angular  coordinates  being  used  as  the  coordinate  coming  out  oi  the 
page),  since  the  "out  of  the  page"  coordinate  is  not  as  easily  dealt  with  as 
the  two  on  the  page.  In  the  example  at  hand,  we  found  two  modes  with  coor¬ 
dinates:  MV  =  722.51  gm/s,  4  =  -8.15°  and  0  =  24.18°,  45.50°. 

It  is  interesting  to  note  that  for  the  present  example,  although 
we  have  used  the  nearest  neighbor  variant  of  the  shifted  histogram  procedure, 
we  have  used  a  fixed  mesh  grid  to  determine  where  the  density  should  be 
estimated.  One  would  be  justified  in  asking  the  question:  would  we  have 
not  have  done  as  well  to  stay  with  fixed  mesh  estimation  as  well?  The 
answer  is,  "yes,  for  the  present  well  behaved  data  set."  In  general,  if 
we  estimate  f  at  a  point  in  its  support,  we  are  implicitly  assuming  it  to 
be  significantly  greater  than  zero  at  that  point.  And,  if  such  be  the 
case,  the  many  practical  advantages  of  a  fixed  mesh  may  be  decisive. 

In  general,  the  greatest  value  of  a  variable  mesh  should  be  in  pointing 
to  those  regions  of  relatively  high  density.  Once  we  have  determined  the 
rough  boundaries  of  these  regions,  we  might  do  well  to  use  a  tuned  fixed 
mesh  estimation  on  each  of  the  regions.  Thus  we  would  be  using 
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(4) 


l 

f  -  V  * 

£j 
J-l 

•  where  fj(x)  ■  0  for  x  €  . 

So  we  are  advocating  a  kind  of  decomposition  into  regions  of  high  density 
using  k-nearest  neighbor  techniques  followed  by  a  fixed  (for  each  region) 
mesh  estimation  of  the  density  in  each  region. 

Although  we  are  still  working  on  this  two  step  algorithm,  we  can 
already  indicate  some  preLimary  results.  First,  we  start  on  the  fringes 
of  the  data  set.  Then  taking  a  data  point  as  the  first  Iterate  x,,  we  let 

(5)  x  ■  Ave(k  nearest  neighbors  of  x  , ). 

n  n-1 

Experience  shows  that,  at  least  in  dimension  of  3  or  less,  the  algorithm 
in  (5)  will  stop  (or  cycle)  prematurely  -  i.e.,  before  a  bona  fide  local 
maximum  of  f  has  been  essentially  reached.  However,  it  generally  brings  us 
into  the  domain  of  attraction  (for  Newton's  method)  of  a  local  maximum.  So 
then,  a  two  stage  averaging  and  Newton's  method  algorithm  appears  to  work 
well  for  finding  the  local  maxima  of  f. 

Following  the  location  of  centers  of  high  density,  we  can  investigate 
estimation  around  each  locally.  This  might  involve,  for  example,  a  pre¬ 
liminary  investigation  using  nearest  neighbor  techniques  to  determine  the 

A 

contours  of  f  values  1/4,  1/8  and  1/32  of  that  at  the  local  maximum.  In 
many  cases,  it  may  be  possible  to  use  parametric  techniques  for  some  of  the 
local  densities.  In  others,  a  fixed  mesh  technique  -  e.g.,  shifted  histo¬ 
gram  or  maximum  penalized  likelihood-  might  prove  useful. 
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FIGURE  2 
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FIGURE  3 
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TESTABILITY  OP  LINEAR  HYPOTHESES  IN  NORMAL  LINEAR  MODELS 

Gerald  S.  Rogers 
New  Mexico  State  University 
Las  Cruces,  New  Mexico 

ABSTRACT.  Let  a  normal  linear  model  be  represented  by  Y  -  X©  + 
e.  It  is  shown  that  the  usual  F  statistic  derived  from  the 
likelihood  ratio  can  be  used  to  test  the  hypothesis  H©  -  0  in¬ 
dependently  of  any  conditions  of  estimability  provided  that 
P(X')  +  p(H')  -  p(X'  ,  H '  )  is  positive,  (p  denotes  the  rank  of  a 
matrix.)  The  inherent  non-uniqueness  leads  to  the  definition  of 
an  effective  hypothesis:  X©  in  the  range  space  of  X(I  -  H  H); 

(+  denotes  the  Moor e-Penrose  generalized  inverse.)  It  is  shown 
that  this  hypothesis  has  an  estimable  form  TX0  -  0  and  that  the 
procedure  is  equivalent  to  a  previous  definition  of  "effective". 

I  .  THE  LIKELIHOOD_RATIO_TEST.  A  basic  linear  model  is 

representable  by  Y  -  X©  +  e  where  Y  is  n  by  1,  X  is  a  given  n  by 

p  matrix  with  rank  r  <  p  <  n,  ©  is  p  by  1,  e  is  an  n  by  1  normal 

2 

random  variable  with  mean  0  and  covariance  matrix  a  In- 

If  the  vector  a  is  an  arbitrary  element  of  the  p-fold 

cartesian  product  with  real  components,  say  ©  e  <J>,  the  parameter 

?  2 

space  is  A  »  {(©  , a  ):  9  e  <t>  ,  o  >0).  The  hypothesis  that  ©  is 

in  a  subspace  0  of  <t>  is  represented  by  0  -  {(©  ,o2):  0  c  0  , 

a 2  >  0).  Denote  a  likelihood  function  by  lik(A  ,Y)  and  the 

ordinary  Euclidean  norm  by  .  A  generalized  likelihood  ratio 

test  of  the  hypothesis  is  based  on  sup  lik(A  ,Y)/sup  lik(0  ,Y) 

2  2 

which  reduces  to  min  iY  -  X©»  /  min  iY  -  X©» 

9  €0  9et> 

Notation:  for  a  matrix  W,  R(W)  is  the  column  range  space; 

N(W)  is  the  column  null  space;  p(W)  is  the  rank;  W+  is  the 

Moor e-Penrose  generalized  inverse;  tr(W)  is  the  trace  when  W  is 
square,  dim  V  is  the  dimension  of  a  vector  space  V. 

Let  H  be  a  given  h  by  p  matrix  of  rank  h  <  p.  Suppose  that, 
the  hypothesis  is  H©“  0;  that  i  s  S  c  0  »  N(H)  ■->  R(Q)  where  Q  “ 
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1  -  H+H .  Note  that  ♦  is  the  direct  sum  of  N(ll)  and  R(H');  also, 

H+H  and  Q  are  symmetric  idempotent  matrices.  Then, 

SSE  -  min  lY  -  X0»2  -  Y'(I  -  XX+)Y 
9e<X> 

SSH  -  min  lY  -  XB«2  -  min  «Y  -  XQw*2  -  Y’(I  -  XQ(XQ)+)Y 
9e0  we® 

(These  minima  are  done  neatly  in  Albert  (1972),  pages  30-36.) 

Now  A  -  XX+  ,  B  -  XQ(XQ)+  ,  I  -  A  ,  I  -  B  are  also  sym¬ 
metric  idempotent  matrices.  It  is  easily  seen  that  AB  »  B  from 
which  it  follows  that  BA  =  B,  A  -  B  is  symmetric  idempotent  and 
(A  -  B ) ( I  -  A)  -0.  "Large  values"  of  SSH/SSE  correspond  to 

"large  values"  of  (SSH  -  SSE)/SSE  «  Y’(A  -  B)Y/Y'(I  -  A) Y .  The 

2 

quadratic  forms  in  this  numerator  and  denominator  ar^  a  multi¬ 
ples  of  independent  chiquare  random  variables  with  t  -  p(A  -  b) 
and  n  -  r  degrees  of  freedom  respectively.  The  non-centrality 
parameters  are  e'X’(A  -  B)X6  -  ©'X'(I  -  B)XB  and  0’X’(I  -  A)X6  - 

G  respectively.  Thus  F  *  Y ' ( A  -  B ) Y/t  +  Y  * ( I  ~  A)Y/(n~r)  will  be 
a  proper  F  random  variable  when  t  is  positive. 

The  import  of  the  non-centrality  parameter  is  discussed  in 
section  III;  the  rank  condition  is  examined  first. 

IIi_RANK_ IN_THE_FOUR_CASES .  The  basic  result  is  the 
THEOREM:  Given  X  n  by  p,  H  h  by  p,  let  Q  -  I  -  H*H,  A  »  XX+,  B  - 

XQ(XQ)+.  Then  t  -  p(A  -  B)  -  dim  R(H’)  n  R(X')  -  p(X’)  +  p(H')  - 

P(X'  , H ' ) . 

Proof:  Both  A  and  B  are  symmetric  idempotent  matirces  and  AB  -  B 
so  also  BA  -  B.  It  follows  that  A  -  B  is  symmetric  idempotent  so 
that  its  rank  is  equal  to  its  trace:  p( A  -  B)  -  tr(A  -  B)  - 
tr ( A)  -  tr (B )  -  p( A)  -  p(B)  -  p(X)  -  p(XQ).  As  part  of  their 
Theorem  1,  Bakaalary  and  Kala  (1976)  show  that  p(X)  -  p(XQ)  - 

dim  R(H')  n  R(X').  As  part  of  their  Theorem  2,  they  show  that 

P(XQ)  *  p(X'  , H ' )  -  p(H').  The  conclusion  follows  by  substitu¬ 
tion.  M 

Of  course,  this  is  merely  a  restatement  of  the  standard 
result  on  dimensions  of  subspaces  v1'v2:  +  V^)  -  dim  + 
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dim  -  dim  n  .  The  following  four  cases  can  occur;  only 
the  first  three  cases  have  been  recognized  previously. 

1.  R(H’)  n  R(X')  -  {0}  iff  t  -  0  iff  A  -  B.  This  is  the 

case  in  which  the  error  sums  of  squares  are  equal  and  there  is 

no  F  teat. 

2.  R(H')  n  R(X’)  -  R(X')  iff  R(X’)  c  R(H’)  iff  t  -  p(X)  iff 

p(XQ)  ■  0  iff  B  -  0  iff  X  «  XH+H.  Here  SSH  *  and  the  F  test 

is  equivalent  to  that  for  testing  X6  »  0  versus  XG  *  0.  Note 
that  p(X)  4  p(H) . 

3.  R(H’)  n  R(X ’ )  -  R(H’)  iff  R(H‘)  is  a  proper  subset  of 
R(X')  (the  equality  to  be  considered  in  2)  iff  t  -  P(H)  only  if 
H  *  HX+X.  This  is  the  usual  condition  that  H0  be  estimable  and 
the  F  test  is  the  common  one.  Note  that  p(H)  <  p(X)  .  In  par¬ 
ticular,  this  is  the  case  when  X  has  full  rank  p  as  in  ordi¬ 
nary  regression. 

4.  R(H')  n  R(X')  is  a  subspace  with  dimension  0  <  t  <  min 
(p(X) , p ( H ) }  as  in  the  example  below.  Within  the  context  of  sec¬ 
tion  III,  the  F  test  is  valid. 

EXAMPLE  Factor  one  has  2  levels  and  factor  two  has  3 
levels;  there  are  k  observations  in  each  cell  and  only  main 
effects  are  to  be  considered.  Let  d  -  (# ,  ,  0  0^  ,  /3^  )  '  and 

110100] 

110  0  10 

110001  fl  10000 

M  =  10  110  0  H-  000010 

101010  (o  00001, 

1  0  J  0  0  1, 

Then  X  *  (1^  «  l^JM  “  M  ®  1^  where  0  denotes  the  Kronecker 
product  and  1^  is  an  k  by  1  vector  of  ail  ones. 

'6  3  3  2  2  21  2/3  -1/3  0  -1/2  -1/2  0 

330111  -1/3  2/3  00  00 

303111  0  000  00 

X’X  -  k  211200  (X ' X) +  -  £  -1/2  00  1  1/20 

[211020  -1/2  0  0  1/2  1  0 

(2  I  1  0  0  2  j  ,0  00  0  0  0 
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It  is  easily  seen  that  H  **  HX+X  .  But  columns  1  and  2  of  H' 

sum  to  column  2  of  M*  and  columns  1  and  3  of  H'  sum  to  column  3 

of  M' .  There  are  no  other  such  linear  dependencies  so  in  this 

case,  dim  R(H’)  n  R(X')  -  2  <  3  -  p(H’)  <  4  -  p(X')  .  f 

On  page  194,  Searle  (1971)  says  that  when  H  a  HX+X  ,  the 

rows  of  H  and  X’X  are  linearly  independent.  The  example  shows 

clearly  that  this  is  not  true  in  general:  the  difference  of  tho 

last  two  rows  of  X'X  is  equal  to  2k  times  the  difference  of  the 

last  two  rows  of  H.  In  a  subsequent  publication,  Searle  (1973) 

has  discussed  other  errors  and  their  corrections. 

The  usual  manipulations  (as  appear  for  example  in  Rao  and 

Mitra  (1971)  chapter  7)  may  be  used  to  find  the  rank  conditions 

when  Hfl  -  h  i*  0,  when  there  are  also  constraints  G8  -  g  con- 

2 

sistent  with  He  -  h  and  when  the  covariance  matrix  is  a  V  with  V 
known.  Such  details  are  given  in  Rogers  and  Urquhart  (1980). 

•  Note  that,  e ' x '  ( i  -  B)xe  -  o 

iff  (I  -  B)xe  -  0  iff  xe  c  N ( I  -  B)  -  R(XQ).  It  is  obvious  from 

the  geometry  that  the  generalized  likelihood  ratio  procedure 

will  lead  to  this  same  F  whenever  a  hypothesis  projects  X8 

into  R(XQ)  whether  or  not  He  is  estimable.  Therefore,  an  anomaly 

of  P  is  that  it  is  a  basis  for  a  test  of  not  one  but  many 

different  hypotheses  all  of  which  lead  to  the  same  effective 

•  X®  €  R (XQ)  .  (Actually,  it  is  not  necessary  that  H 

have  full  row  rank  but  only  that  p(H)  <  p.) 

There  is  in  fact  an  estimable  one  of  these  hypotheses  as  is 

implicit  in  results  of  Scheffe  (1959,  page  34).  Let  T*  be  an  n 

by  t  matrix  whose  columns  generate  the  orthogonal  complement  of 

R(XQ)  in  R ( X ) .  Since  TXQ  -  0,  TX8  -  0  when  H9  -  0.  But  TX9  -  0 

implies  xe  e  N(T)  which  is  the  orthogonal  complement  of  R(T’); 

thus  xe  c  R(XQ).  Therefore,  SSH  is  the  minimum  when  TXe  -  0  of 
2 

•Y  -  xe«  and  the  same  F  is  obtained.  Obviously,  TX9  is  an 
estimable  function.  Since  d'TX  -  0  iff  T'd  is  orthogonal  to 
R(X) ,  d'TX  «  0  implies  T'd  -  0  which  in  turn  implies  d  -  0;  thus 
P  (TX )  -  p ( T)  -  t. 
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EXAMPLE  Pot  the  model  of  section  II  with  k  •  1,  X  -  M. 


HSC  (Hocking,  Speed  and  Coleman  (1980))  used  "effective 
model"  /"effective  constraint*  and  "effective  hypothesis"  in  a 
discussion  of  the  cell  means  model.  In  ozdet  to  see  the  equiv¬ 
alence  with  the  present  definiton  of  effective,  it  is  conven¬ 
ient  to  use  a  factorization  of  the  X  matrix  as  is  illustrated  in 
the  following 

EXAMPLE  Suppose  that  in  the  model  of  section  II,  observations 
are  available  only  for  cells  12  13  21  23.  The  corresponding 
incidence  matrix  is  010000 

0  0  1  0  0  0 

u  “  0  0  0  1  0  0  •  Let  S  be  a  block 

0  0  0  0  0  1, 

diagonal  matrix  with  "diagonal"  lkl  ,  lk2  ,  lk3  ,  lk4  representing 
(possibly)  different  numbers  of  replications  in  the  "observed" 
cells.  Then  the  new  X  ■  SUM.  For  the  cell  means  model,  v  ■  He 

and  X6  -  SUv  -  s(vi2' v13' v21'v23^  '  ‘  * 

In  general  for  a  cell  means  model,  EfY]  •  SUv  »  Sv  where 

o 

vQ  is  the  vector  of  means  of  cells  for  which  there  are  observa¬ 
tions;  say  v  is  q  by  1.  Index  the  cells  so  that  v  -  (v'  , v ’ ) ' 
where  vm  is  the  vector  of  means  of  cells  which  are  "missing"; 
say  vm  is  m  by  1  so  p  -  m  +  q.  Now  U  takes  the  form  (0  ,1  ).  For 
a  hypothesis  Gv  -  0  (G  being  g  by  p  of  rank  g),  without  loss  of 
generality,  one  may  use  tow  reductions  to  write  this  as 
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whore  0  has  maximal  rank.  The  effective  hypothec  is  of  HSC  ie 
o 

G  v  *  0  in  the  effect  v.-?  ,  lei  ¥  -  Sv^  +  e  .  By  the  techniques 
o  o  o 

of  the  present  paper,  .r.it  ie-.us  to  E[K]  e  R(SQ0)  with  Qq  -  I  - 

G+G  .  Por  Gv  -  0  in  ti>*»  »..•  ¥  -  SOv  +  e,  the  xelevant  sub- 

°  °  ♦ 
space  is  R(SUQ)  with  Q  -  I  -  00  . 

Since  G  has  full  row  rank,  0+  »  G’(GG')  Let  (GO')  1  toe 

partitioned  appropriately  with  "rows"  C  D  /  J  L.  It  turns  out 

that  UQ  has  "columns" 


-(G'  C  +  G'J)0  /  I  -  <G'  C  +  G'J)G  -  (G*  D  +  G'L)G„ 

'  mo^  o  1  mm  /  q  '  mo  o  mo  mo  o  o 

Then  from  GG+G  -  G  ,  one  gets  OqUQ  -  0  and  hence  QoUQ  -  UQ. 

Now  R(SUQ)  c  R(SQ0)  iff  SQo(SQo)~SUQ  -  SUQ  .  (See  Rao  and 
Mitra  (1971);  "-"  denotes  any  generalized  inverse.)  Since  S+  » 
(S'S) ~^S'  and  Qo  is  symmetric  idempotent,  one  form  of  (SQq)  i» 
0QS+:  S0o0oS+SQo  -  SQo0 0Q0  -  SQq  .  Hence,  R(SUQ)  c  R(SQo)  iff 
SQ0UQ  »  SUQ  iff  QqUQ  -  UQ  which  has  just  been  demonstrated.  But 
QqUQ  -  UQ  iff  R(UQ)  c  R(Q0)  since  Q*  -  Qq.  Now  p(UQ)  -  p(U’,G*) 

-  p(G')  -  pfO^)  ♦  q  -  P(0)  -  q  "  (P«»)  "  „))  •  q  -  P<V 

*  P(Q0)  so  that  these  subspaces  are  equal.  Thus  Y  •  SUv  +  e 

with  Gv  -  0  and  Y  -  Sv  with  G  v,  -  0  lead  to  projection  of  ElY) 

o  o  o 

into  the  same  subspace  and  consequently  have  the  same  SSH  with 


degrees  of  freedom  n  -  q  +  p(Go). 

When  Gv  -  0  represents  a  constraint  rather  than  a  hypoth¬ 
esis,  Y ' ( I  -  SUQ (SUQ)  +  ) Y  -  Y  * ( 1  -  SQo(SQo)+)Y  -  SSE  rather 

than  SSH.  Then  in  the  method  of  HSC,  C„„v  +  G„v_  -  0  is 

mm  m  mo  o 

used  to  eliminate  all  or  part  of  vm  from  the  hypothesis  Hv  - 

0.  Suppose  that  this  produces,  +  Hmovo  “ 

reductions  are  used  to  write  (H  ,  ,H  )  as  H 

mi  mo 

l 

with  H  of  maximal  rank;  their  effective  hypothesis  is  H  v  - 
o  o  o 

0.  As  above  (with  messier  details),  SSH  -  Y'(I  -  SUQ(SUQ)+)Y  - 

¥ ' ( I  -  SQ0(SQ0)+)Y  for  Q  -  Q'  -  I  -  (O’  ,H')(G’  ,H')  +  and  Qq  - 

Q'  -  I  -  (O'  ,H')(G'  ,H')+  .  Here  the  degrees  of  freedom  is  n  - 
o  o  o  o 

q  ♦  P(G^  ,H^)  and  t  -  p(G^  ,H^)  -  p(O^)  . 


0;  then  row 
*  * 

ml  .no 
0  Ho 


XV ^REMARKS  .  Por  the  balance')  model  considered  previously 


with  no  missing  cells,  let  H0  -  (0  ,  I^)©  ■  (al,a2'^l'^2'^3^ ' ' 

Then,  Q  -  I  -  H+H  has  rows  10/00  and  XQ  -  (ln  ,0)  where  the 

zeroes  denote  zero  matrices  of  appropriate  dimensions.  Now  I  - 

XQ(XQ)*  -  I  -  1  l'/n  has  rank  n  -  1,  n  -  6k,  and  p(X)  -  4.  The 
n  n 

numerator  in  the  P  test  for  He  -  0  has  degrees  of  freedom  (n  - 
1)  -  (n  -  4)  -  3  -  t. 

In  fact,  for  any  X  n  by  p  with  first  column  ln,  and  H  -  (0  , 

I  _1),  H®  •  0  is  testable  in  the  same  way  though  H©  need  not  be 
estimable;  R(XQ)  -  R(ln  ,0)  -  R(ln). 

In  analysis  of  variance,  the  columns  of  X'  represent  cell 
means:  t  will  be  positive  iff  the  hypothesis  He  -  0  contains 
Implicitly  the  hypothesio  that  (at  least)  one  linear  combi¬ 
nation  of  means  of  cells  for  which  there  are  observations  is 
zero.  An  example  is  the  TX  in  section  III  which  yields 

v12  *  v13  “  0  and  V22  "  v23  “  0 

Rao  and  Mitra  (1971)  show  that  varying  W  arbitrarily  in  H 

+  W  -  H+HWHH+  generates  all  generalized  inverses  H".  Then,  Qft  - 

I  -  HH  -  (I  -  H+H)(I  -  WH)  say  QW  .  The  results  in  Baksalary 

and  Kala  (1976)  are  proved  actually  for  H  so  p(XQ  )  -  p(X’  ,H') 

-  p(H')  -  p (XQ)  and  R(XQ  )  ■  R(XQ) .  The  use  of  usually 

A 

simplifies  the  calculations. 

Note  that  the  ranks  may  be  calculated  at  any  convenient 
stage  in  p(X)  -  p(X'X)  -  p(XX+)  -  tr(XX+)  and  similiarly  for 
p (H) ,  p(X*  , H * ) .  Hemmerle  (1979)  discusses  computer  aspects. 

This  chore  can  be  further  simplified  by  f actor izat ion  of  X  as 
indicated  above.  The  new  X  -  SUM  and  p(X)  -p(X'X)  - 
p(M'U'S'SUM)  -  p(M'U'UM)  -  p(UM)  since  S'S  is  positive  defi¬ 
nite.  Similarly,  for  any  Q,  p(XQ)  -  p(UMQ) ,  etc.  When  there  are 
no  missing  cells,  U  is  an  identity  matrix. 

The  author  extends  his  thanks  to  Mary  Ann  Maher,  N.  Scott 
Urquhart  and  David  V.  Hinkley  for  stimulating  discussions  of 
this  topic  which  consequently  appears  in  a  different  form  than 
that  presented  at  the  Conference. 


333 


REFERENCES 

[1]  Albert,  A.  (1972)  Regression  and  the  Moor e-Penrose 
Pseudo Inverse,  Academic  Press,  New  York. 

[2]  Saksalary , J . K .  and  Kala,R.  (1976)  Extensions  of  Milliken's 
estimability  criterion,  Annals  of  Statistics  4  639-641. 

[3]  Hemmerle,  W.J.  (1979)  Balanced  hypotheses  and  unbalanced 
data,  J.  Airier ^  Statist^  Assoc ^  74  794-798. 

[4]  Hocking,  R.R.,  Speed,  F.M.  and  Coleman,  A.T.  (1980)  Hy¬ 
potheses  to  be  tested  with  unbalanced  data,  Commun^  Statist L 
Theor-Meth  A9(2)  117-129. 

[5]  Rao ,  C .  R .  and  Mitr a , S . K .  (1971)  Generalized  Inverse  of  Matrices 
and  its  Applications,  Wiley  and  Sons,  New  York. 

[6]  Rogers,  G.S.  and  Urquhart,  N.S.  (1980)  Testability  of 
linear  hypotheses  in  normal  linear  models,  Tech^  Rpt^  no:  36 
NMSU  Statist.  Lab.,  Las  Cruces,  N.M. 

f 

[7]  Scheffe,H.  (1959)  The  Analysis  of  Variance,  Wiley  and 
Sons,  New  York. 

[8]  Searle,  S.R.  (1971)  Linear  Models,  Wiley  and  Sons,  New 
York. 

[9]  Searle,  S.R.  (1973)  Testing  non-testable  hypotheses  xn 
linear  models:  corrections.  Biometrics  Unit,  Cornell  University 
Ithaca,  N.Y. 


334 


THE  POTENTIAL  UTILITY  Of  CROSSING  A  FRACTIONAL 
FACTORIAL  WITH  A  FULL  FACTORIAL  IN  THE  DESIGN  OF  FIELD  TESTS 


Carl  T.  Russell’ 

US  Army  Cold  Regions  Test  Center 
Fort  Greely,  Alaska 

ABSTRACT .  Typically  in  the  design  of  a  field  test,  there  are  more 
factors  than  can  be  completely  tested  (in  a  full  factorial)  within  resource 
constraints.  Usually  one  or  two  "primary"  factors  and  a  few  "secondary" 
factors  are  selected  as  design  variables  and  all  other  factors  of  possible 
interest  are  either  fixed  or  ignored.  Traditionally,  "replicates"  of 
several  overlapping  full  factorials  in  the  primary  factor  and  one  or  two 
secondary  factors  at  a  time  are  conducted,  yielding  a  design  made  up  of  many 
side  tests  and  sensitivity  tests.  This  paper  presents  an  alternative  to  the 
traditional  design  approach  in  terms  of  an  example  which  calls  for  con¬ 
ducting  full  factorials  in  the  primary  factor  with  combinations  of  levels  of 
secondary  factors  specified  by  a  fractional  factorial  in  the  secondary 
factors.  Potential  pragmatic  and  analytical  advantages  and  disadvantages  of 
such  designs  are  discussed  with  emphasis  on  design  flexibility  and  inherent 
blocking  schemes. 

I: _ 1_N1  ROUUtnON.  Within  the  Army  testing  community,  a  test  design 

for  a  field  test  is  usually  regarded  as  synonomous  with  a  test  matrix,  that 
is,  a  matrix  identifying  combinations  of  controlled  test  conditions  and  the 
number  of  times  each  combination  is  to  be  tested.  As  a  result,  both  re¬ 
source  managers  and  test  operators  tend  to  become  infatuated  with  sample 
size:  the  resource  manager  often  bases  judgements  primarily  on  the  "number 
of  replications"  while  the  test  operator  concentrates  primarily  on  obtaining 
the  "requisite  sample  size"  in  each  cell  as  efficiently  as  possible.  I 
think  that  as  statisticians,  we  should  find  ways  to  deemphasize  such  summary 
matrices  and  emphasize  test  structure  in  a  manner  marketable  to  the  testing 
community. 

The  structure  1  refer  to  is  blocking.  In  my  experience,  all  field 
tests  are  conducted  in  blocks  of  time  or  space.  Block  sizes  are  usually 
dictated  by  resource  constraints,  but  block  contents  are  most  often  dictated 
by  expedient  completion  of  the  "requisite  sample  size,"  leading  to  extensive 
confounding  of  possible  block  effects  with  factor  effects  of  interest.  The 
standard  request  that  order  of  trials  be  randomized  (as  much  as  possible 
within  test  constraints)  over  the  entire  test  matrix  does  not  give  the  test 
operator  the  usable  systematic  statistical  advice  he  should  have  for  test 
conduct.  Consequently,  extensive  randomization  seldom  occurs  in  test  con¬ 
duct,  a  fact  usually  ignored  during  data  analysis. 


^Much  of  the  work  underlying  this  paper  was  done  while  the  author  was 
a  Research  Staff  Member,  Systems  Evaluation  Division,  Institute  for  Defense 
Analyses,  Arlington,  Virginia. 
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]  believe  that  for  most  field  tests,  the  resource  structure  suggests  a 
natural  structure  which  can  be  exploited  by  the  test  designer  to  produce  a 
fairly  small  basic  test  matrix  addressing  test  factors  of  interest.  This 
matrix  could  then  be  conducted  in  blocks  with  both  block  order  and  order  of 
trials  in  blocks  randomized  as  much  as  possible  within  test  constraints. 
(Notice  that  this  requested  randomization  is  fairly  small-scale  and  should 
fit  naturally  into  test  conduct,  hopefully  yielding  substantial  actual 
randomization  and  hopefully  precluding  gross  damage  in  case  of  non- 
randomization.)  The  basic  matrix  would  then  be  repeated,  possibly  with  some 
factors  or  factor  levels  deliberately  changed  in  sensitivity  tests.  This 
intuitively  simple  "basic  matrix  approach"  can  yield  highly  structured 
designs  which  not  only  are  executable  but  also  permit  refined  analysis. 

After  discussing  the  traditional  appoach  to  field  test  design  in  the 
special  case  where  the  goal  is  to  compare  levels  of  a  primary  factor  in  the 
presence  of  several  secondary  factors,  this  paper  applies  the  basic  matrix 
approach  to  a  particular  test  resource  structure  involving  primary  and 
secondary  factors.  The  resulting  example  illustrates  the  flexibility  and 
potential  analytic  richness  inherent  in  the  basic  matrix  approach. 

11.  1  HI  TRADITIONAL  DESIGN  APPROACH.  Often  in  field  testing  there  is 
a  control  Table  factor  of  primary  T  nterest2  (say,  G,  at  four  levels)  and 
there  are  several  controllable  factors  of  secondary  interest  (  say,  A,  8,  C, 
and  D,  each  at  two  levels).  The  traditional  approach  then  evolves  a  design 
along  the  following  lines.  Initially  it  is  determined  that  a  sample  size  of 
20  trials  per  cell  is  desirable  based  on  the  "rule  of  fingers  and  toes."3  A 
2*  x  4  full  factorial  with  20  "replications  per  cell,  however,  requires  1280 
trials,  so  the  designer  shrinks  the  number  of  cells  by  selectively  deleting 
cells  to  obtain  a  test  design  with  20  "replications"  per  cell  made  up  of 
three  overlapping  22  x  4  full  factorials,  AxBxG,  AxCxG,  and 
B  x  0  x  G  (the  suppressed  factors  are  fixed  at  their  low  levels).  This 
yields  a  matrix  of  640  trials  which  appears  in  the  approved  Test  Design 
Plan.  By  test  date,  howpver,  Hiylier  Headquarters  has  determined  that  re¬ 
source  constraints  will  permit  only  320  trials  and  that  two  other  factors 
(previously  considered  to  be  fixed)  should  be  varied  in  sensitivity  tests. 
Adding  two  new  factors  E  and  F  and  redistributing  the  shrunken  sample  size 
yields  the  test  matrix  of  Table  1,  which  is  fairly  typical  of  a  design  for  a 
large  f ield  test. 


The  TTTus trations  in  this  paper  deal  with  just  one  primary  factor  at 
several  levels.  An  ohvious  generalization  regards  these  levels  as  combina¬ 
tions  of  levels  of  two  or  more  factors  making  up  a  full  factorial  In  those 
factors. 

3  This  rule  states  that  anything  likely  to  be  operationally  significant 
is  likely  to  show  up  in  samples  countable  on  fingers  and  toes  and  vice- 
versa. 
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TABLE  1.  A  Typical  Test  Matrix  Arising  From 
the  Traditional  Design  Approach 


Secondary 

Factors 

Sensitivity 

Tests 

Primary 

Factor(G) 

1 

3 

D 

H 

E 

m 

9i 

92 

93 

94 

a.i 

D 

Cl 

di 

*i 

wm 

9 

9 

9 

9 

■ 

IB 

4 

B 

B 

B 

1 

*2 

U 

m 

B 

B 

B 

1 

d2 

®1 

wm 
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9 

9 

n 
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e2 

fi 

4 

B 

B 

B 

I 

C2 

KB 

■a 

IB 

9 

9 

9 

9 

g 

Cl 

KB 

■0 

IB 

9 

9 

9 

9 

■ 

b 

■0 

IB 

9 

9 

9 

9 

a2 

D 

D 

KB 

BB 

f. 

9 

9 

9 

9 

■ 

c2 

di 

ei 

9 

9 

9 

9 

I _ 

m 

1 

in 

*i 

U 

9 

9 

Bi 

Table  entries  are  the  number  of  trials  to  be  conducted  under 
each  test  condition.  The  order  In  which  trials  are  conducted 
is  to  be  randomized  as  much  as  possible  within  test  constraints. 


Ihis  design  still  contains  the  three  overlapping  22  x  4  full  factorials 
referred  to  earlier  (now  with  only  9  "replications"  per  cell);  these  sub¬ 
designs  examine  the  effects  of  A,  B,  C,  and  D  and  their  possible  inter¬ 
actions  with  G  in  "side  tests,"  each  involving  45  percent  of  all  test 
trials.  In  addition,  the  design  contains  two  "sensitivity  tests,"  each 
involving  10  percent  of  all  test  trials:  the  2x4  design  F  x  G  (with  4 
"replications"  per  cell4  and  with  all  other  factors  fixed  at  low  levels)  and 
the  ?3  design  0  x  E  x  G*  (with  4  "replications"  per  cell4,  with  factor  G 
restricted  to  two  levels,  and  with  all  other  factors  fixed  at  low  levels). 
Analysis  would  typically  consist  of  analyzing  data  from  each  subdesign 
separately,  assuming  randomization  was  complete. 


'  Actually  these  subdesiqns  call  for  either  4  or  9  trials  per  cell  and 
would  probably  be  analyzed  as  unbalanced  designs;  in  spirit,  however,  they 
have  4  "replications"  per  cell. 
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The  traditional  design  approach  has  two  main  practical  advantages. 
First,  it  is  accepted:  it  is  at  heart  a  simple  "vary-one-factor-at-a-time" 
approach  to  experimentation,  and  it  is  time-proven.  Second,  it  is  flexible: 
pre-test  reduction  of  resources  or  addition  of  factors  can  easily  be  accomo¬ 
dated  by  re-distributing  the  available  sample  size,  and  the  lack  of  detailed 
structure  hides  damage  due  to  data  loss  during  test  execution. 

However,  the  traditional  design  approach  has  three  great  statistical 
disadvantages.  The  first  two  disadvantages  are  technical:  the  resulting 
designs  are  inefficient  and  give  main  effects  biased  by  (non-estimable) 
2-factor  interactions.  The  third  and  most  important  disadvantage  Is  prag¬ 
matic:  by  emphasizing  sample  size  rather  than  design  structure  the  tradi¬ 
tional  approach  permits  sample  size  to  be  whittled  down  cel  1-by-cel  1 ,  and  it 
falls to  give'  systematic  statistical  guidance  for  detailed  test  conduct 
which  "might  preclude  day-to-day  test  schedul i no  ^rom  totally  dominating 
randomization  and  which  could  provide  a  solid  base  for  accurate  statistical 
inference, 


III.  REQUIREMENTS  FOR  AN  ALTERNATIVE  DESIGN  APPROACH.  To  be  marketable 
to  the  testing  community  (and  executable  in  the  field)  any  alternative  to 
the  traditional  design  approach 

•  must  be  Intuitively  (if  not  analytically)  simple 

•  must  conscientiously  consider  problems  of  test  scheduling  and 
control 

»  must  permit  insertion  of  meaningful  sensitivity  tests 

•  must  degrade  gracefully  in  the  race  of  resource  reductions  or 
substantial  data  loss 

In  actuality,  the  traditional  design  approach  meets  only  the  first  of 
these  requirements  well.  The  complete  randomization  prescribed  by  the 
traditional  approach  does  not  consider  problems  of  test  scheduling  and 
control,  and  it  is  seldom  well-followed  during  test  execution.  Sensitivity 
tests  frequently  involve  too  few  trials  for  conclusive  results,  and  many  of 
those  trials  are  not  usable  for  any  other  purpose.  Furthermore,  reductions 
in  sample  size,  through  either  resource  reductions  or  data  loss,  are  accom¬ 
modated  easily  only  because  traditional  designs  lack  detailed  structure 
which  could  illuminate  the  consequences  of  sample  size  reductions  In  terms 
other  than  square  root  of  sample  size  ratio. 

The  example  given  now  shows  that  a  design  generated  by  the  basic  matrix 
approach  can  meet  these  requirements. 

VI.  AN  EXAMPLE  BASED  ON  CROSSING  A  FRACTIONAL  FACTORIAL  WITH  A  FULL 
F AC The  basic  matrix  approach  was  implemented,  as  summarized  in 
Figure  T,  in  a  design  proposed  for  a  test  with  the  goal  of  comparing  5 
levels  of  a  primary  factor  under  a  variety  of  operational  conditions,  some 
of  which  i.ould  be  specified  as  combinations  of  secondary  factor  levels. 
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The  resource  structure  for  the  test  was  as  follows.  There  were  three 
six-week  test  periods,  between  which  substantial  variations  in  equipment, 
personnel,  and  SOP's  were  expected.  Each  test  period  consisted  of  three 
two-week  test  segments,  between  which  personnel  were  expected  to  change  but 
equipment  and  SOp's  were  not.  It  was  suspected  that  crew  learning  might  be 
substantial  within  test  segment,  and  It  was  anticipated  that  trials  could  be 
conducted  at  the  rate  of  4  trials  per  day  for  10  days  per  test  segment, 
yielding  40  trials  per  test  segment.  Since  there  were  5  levels  of  the 
primary  factor  and  40  trials  per  test  segment  appeared  to  be  feasible,  it 
was  natural  to  define  a  basic  matrix  calling  for  all  five  levels  of  the 
primary  factor  to  be  tested  under  8  combinations  of  secondary  factor  levels 
during  each  test  segment.  Since  8  =  23  =  %  x  2A ,  either  3  or  possibly  4 
secondary  factors,  each  at  2  levels,  could  be  accommodated  in  the  basic 
matrix.  Conveniently,  there  are  half  replicates  of  the  2 4  design  which  are 
of  resolution  IV  (that  is,  all  four  main  effects  are  estimable  free  from 
2-factor  interactions  provided  3-factor  and  higher-order  interactions  are 
suppressed)  furthermore,  these  half-replicates  are  fold-over  designs  (they 
can  be  run  in  blocks  of  size  2  without  losing  the  resolution  IV  property). 
One  of  those  half  replicates  was  therefore  chosen  to  define  the  8  combi¬ 
nations  of  secondary  factor  levels  for  the  basic  test  matrix  which  appears 
as  Table  2. 


TABLE  2.  The  Basic  Test  Matrix 
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C2 

d2 

cr 

a 

a 

Or 

The  basic  matrix  is  to  be  run  in  four  blocks  (a,  p,  y,  6) 
of  ten  trials  each.  The  order  in  which  the  four  blocks  are 
run  should  be  chosen  at  random  each  time  the  matrix  is  run, 
and  the  order  in  which  trials  are  conducted  within  blocks 
should  be  randomized  as  much  as  possible  within  test  con¬ 
straints. 
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The  blocking  scheme  in  Table  2,  together  with  its  associated  randomi¬ 
zation  scheme,  was  chosen  to  exploit  the  fold-over  property  of  the  frac¬ 
tional  factorial  while  providing  an  executable  design  and  a  randomized  block 
structure  on  the  primary  factor.  Within  the  basic  matrix,  main  effects  of 
both  primary  and  secondary  factors,  as  well  as  primary-secondary  2-factor 
interactions,  are  estimable  free  from  blocks  and  secondary- secondary  2- 
factor  interactions.  Secondary- secondary  two  factor  interactions  are  con¬ 
founded  with  each  other  in  pairs  and  also  confounded  with  blocks.  The 
randomization  scheme  does  not  severely  constrain  the  test  operator;  instead 
the  randomization  scheme  together  with  the  blocking  scheme  provide  the  test 
operator  with  the  sort  of  systematic  statistical  guidance  he  should  have  to 
obtain  a  statistically  defensible  data  set.  Because  block  order  is  ran¬ 
domized  (this  should  be  entirely  executable  because  of  similar  structure 
within  different  blocks),  some  inference  regarding  order  effects  (possible 
“crew  learning'1}  might  be  possible  even  though  such  order  effects  would  be 
confounded  in  a  complicated  random  way  with  suppressed  2- factor  interactions 
between  secondary  factors.  The  requested  randomization  of  trials  within 
blocks  could  probably  not  be  complete  because  it  in  unlikely  that  all  of  the 
changes  in  factor  levels  between  an  arbitrary  pair  of  trials  would  be  prac¬ 
tical.  All  trials  within  a  block,  however,  could  reasonably  be  conducted 
within  a  few  days--which  would  give  each  block  reasonable  analytic  inter- 
prelaliuir-and  even  partial  randomization  should  preclude  systematic  within 
block  bias.  Moreover,  each  original  block  could  be  regarded  as  two  blocks 
in  the  levels  of  the  primary  factor,  each  defined  by  one  combination  of 
secondary  factor  levels.  Thus  the  blocking  and  randomization  schemes  make 
the  core  of  the  design  a  very  simple  randomized  complete  blocks  design  in 
the  levels  of  the  factor  of  primary  interest. 

Several  assumptions  are  made  in  what  follows.  First,  it  will  be 
assumed  that  3-factor  and  higher-order  interactions  among  primary  and 
secondary  factors  are  zero.  For  convenience,  possible  2-factor  secondary¬ 
secondary  interactions  will  also  be  suppressed  (which  ignores  a  possible 
source  of  bias  in  the  block  effects  discussed).  It  will  also  be  assumed 
that  the  error  terms,  c,  are  uncorrelated  random  variables  with  mean  zero 
and  the  same  variance.  Although  least  squares  estimation  is  implicit  in  the 
discussion,  linear  models  are  discussed  more  to  illuminate  the  potential 
stucture  of  data  sets  produced  by  basic  matrix  designs  than  to  provide 
specific  analytic  methods. 


A. 


The  Initial  Model 
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Model  I: 


Y 

i  jkmnpqs 

Side  Conditions 


P  +  g.  +  a.  +  (ga). .  ♦  bR  ♦  (gb),k  -  cm  -  (gc)iffl  -  *n 
4  ^d^in  4  ^pqs  4  ci jkmnpqs 


(1) 


0  =  9  =  a>  =  b.  -  c.  =  d.  =  (ga)i#  =  (ga). j  =  ( gb ) i .  =  (gb). k 

=  <<ic)..  =  (gc).m  =  (gd) - .  =  (gd).n  =  p...  =  0  (2) 

B.  Modifications  and  Extensions  of  the  initial  Model.  Although  Model 
1  has  60  i ndependent  parameters ,  illustrating  the  rich  analytic  structure 
potentially  available  in  a  data  set  generated  from  the  basic  matrix  ap¬ 
proach,  this  model  can  be  modified  by  reinterpreting  existing  parameters  and 
can  be  extended  by  adding  new  parameters. 

The  most  obvious  modification  of  the  initial  model  is  a  reinterpre¬ 
tation  of  the  block  effects.  The  q^h  block  in  the  s^  segment  of  the  pth 
test  period  contains  interblock  information  on  potential  effects  of  period, 
np.  segment,  within  period,  <jips,  and  order  or  "crew  learning,"  K  ♦  (nA)pq 

1  (-W)  ■  that  is,  the  35  independent  block  effects,  B  „  ,  can  be  reinter- 

qps  pqs 

preted  as 

=  "p  4  V  +  Aq  4  (nA)pq  +  (^}qps’  (3) 


—  *  V  =  K  =  =  (^).q  *  (^)qp. 

=(**).ps=0.  (4) 

In  addition,  the  whole  basic  matrix  is  crossed  with  test  segments  and 
test  periods,  so  all  interactions  of  test  segment  and  test  period  with 
primary  and  secondary  factors  are  estimable  and  can  be  incorporated  into  the 
model  as  desired.  In  particular,  the  fact  that  rather  large  changes  in  test 
conditions  were  expected  from  test  period  to  test  period  suggests  that 
possible  interactions  of  test  period  with  primary  and  secondary  factors  be 
incorporated  into  the  model.  This  is  done  in  Model  II,  which  also  allows, 
as  an  example,  possible  interactions  of  test  segment  with  factors  G  and  A. 
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Vijkmnpqs  =  “  4  "p  4  V  *  9'  *  (9"}ip  *  (W)ips  *  *j  *  <M)jP 

4  <s|i’>jpS  4  (9a>ij  4  <9an)(jp  *  (9»*),jps  4  bk  4  0*")kp 

4  <«b>ik  4  <9b">ikp  4  c„  4  «=">mp  4  (9C>im  ♦  (9C")i„p 
4  °n  4  (d">pp  4  4  <9<f>inp  4  %  4  («')pq  4  <)‘+)qps 

^ijl.mnpqs 

Side  Conditions  in  Addition  to  (2)  and  (4) 

(i  ^  (gn)..  =  (gn).p  =  (g»j>)ip.  =  (g<l').ps  =  (an)j.  =  (an)-p  =  (aa,)jp- 
=  (ay,).pS  =  (9an>ij-  =  (9an)..p  =  (gan).jp  =  (gaaOijp.  =  .ps 

--  (gau0.jps  -  (b n)k.  *  (bn).p  =  (gbn)^.  =  (gbn)..p  =  (gbn).kp 


=■  (cn)nr  s  (cn)-p  a  (9cn)im-  =  <Ocn>i-p  =  (9cn)-mp  =  (dT°n- 
-  (dn).p  (gdn)^.  =  (9dn)i>p  =  (gdn).np  =  0  (6) 

C.  Inserting  Sensitivity  Tests.  The  102  independent  parameters  added 
in  passing  from  Model  I  to  Model  11  potentially  enable  stronger  inferences 
by  explicitly  accounting  for  interaction  effects  of  nuisance  parameters. 
They  also  enable  the  insertion  of  sensitivity  tests  by  allowing  certain 
factors  to  change  from  period-to-period  or  segment-to-segment. 


The  simplest  way  to  introduce  a  sensitivity  test  is  to  replace  one  of 
the  secondary  factors  in  the  basic  matrix  by  other  factors  for  one  or  more 
test  periods.  For  instance,  factor  B  could  be  replaced  by  three  factors  B j , 
B,. .  and  B, ,  each  at  2  levels  and  each  varied  during  one  test  period  (and 
fixed  «Jii r i ny  the  other  two  periods).  Then  Bp  would  be  nested  in  test  period 

p,  and  its  main  effect,  (b  ),  ,  as  well  as  its  interactions  with  the  primary 

P  * 

factor,  (gbp)-k,  would  have  meaning  only  within  period  p.  The  15  indepen¬ 
dent  Model  11  parameters  involving  factor  B  would  be  replaced  by  the  15 
independent  parameters  (bp)k  and  (gbp)lk  with  bp#  =  (gbp).^  =  0.  Adding  a 

sensitivity  test  in  this  manner  does  not  impact  the  precision  or  bias  of  the 
estimators  of  any  primary  or  secondary  effects  except  those  of  B.  However, 
the  estimator  of  each  estimable  effect  involving  B  is  based  on  only  one 
third  the  observations  for  corresponding  estimators  ^involving  other  secon¬ 
dary  factors,  and  each  is  biased  by  possible  2-factor  interactions  with  test 
period  (which  are  nonestimable) . 


Alternatively,  a  factor  could  be  replaced  for  one  or  more  test  segments 
in  each  test  period,  but  such  a  situation  will  not  be  discussed  here. 
Instead,  a  different  method  for  adding  a  sensitivity  test  is  considered. 
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Suppose,  for  example,  that  factor  A  represented  two  types  of  jamming  and  a 
sensitivity  test  were  desired  to  investigate  whether  different  methods  of 
employment  of  each  type  jamming  had  any  substantial  effects.  By  selecting 
three  different  methods  of  employment  and  applying  each  during  one  randomly 
selected  test  segment  of  each  test  period,  the  desired  sensitivity  test 
could  be  accommodated  in  the  design  and  in  the  model.  This  essentially 
confounds  a  new  factor,  E,  with  test  segment.  The  six  independent  para¬ 
meters  4»ps,  each  reinterperted  as  being  the  effect  of  the  segment  of  period 

p  for  which  factor  E  is  at  level  s,  lead  to  six  independent  parameters 
es  =  «l).s  and  (ne)ps  =  «jjps  -  e$  with  e.  =  {n«)p.s  =  0-  In  addition,  the  54 

independent  interaction  parameters  (94*)^ps,  (a4»)jps>  and  (g«|0-jps 

54  independent  parameters  (not  all  of  which  are  necessarily  worth  enter¬ 
taining  in  a  model)  describing  possible  differing  effects  of  jamming  method 
between  jamming  types  and  between  levels  of  the  primary  factor: 

(ge)ls  =  Cg4»)i>s,  (gne)ips  -  <g*)ips  -  <ge)is. 

(ae)js  =  <*»j.S*  (ane)jps  =  <*>jps  ‘  (ae)js’ 

(gae)ijs  =  (ga^)- j>s,  (gane)ijps  =  <ga*)ijps  -  (gae)ijs-  (6) 

When  a  sensitivity  test  is  inserted  in  this  manner,  none  of  the  estima¬ 
tors  of  the  original  primary  or  secondary  factor  effects  are  directly  im¬ 
pacted.  As  when  changing  a  factor  from  period-to-period,  however,  estima¬ 
tors  of  effects  involving  the  sensitivity  factor  require  more  judgemental 
interpretation  than  those  involving  the  primary  and  secondary  factors  be¬ 
cause  of  confounding. 

Clearly,  several  sensitivitv  tests  could  be  inserted  into  the  same 
design  by  proceeding  along  the  lines  suggested  above.  Although  results  of 
such  sensitivity  tests  cannot  be  expected  to  be  as  clear  as  results  in¬ 
volving  factors  examined  in  each  basic  matrix,  they  can  involve  sufficient 
numbers  of  well  organized  trials  to  justify  their  conduct.  Moreover,  these 
sensitivity  tests  follow  the  spirit  of  efficient  statistical  design,  inter- 
•aving  new  fartors  by  confounding  them  with  interactions  of  old  factors  and 
reparameterizing  i ather  than  stealing  a  few  observations  from  the  existing 
design  and  conducting  a  tiny  excursion. 

b.  Graceful  Degradation.  Reduction  of  the  number  of  trials,  either 
before  or’  during  testing,  could  be  accommodated  by  the  design  in  simple  ways 
having  clear  consequences. 

One  easy  way  to  accommodate  pre-test  resource  reductions  would  be  to 
eliminate  test  periods  or  test  segments.  Termination  of  the  test  after  two 
test  periods,  instead  of  three,  would  have  only  a  slight  impact,  primarily 
that  of  sample  size  reduction.  Differences  between  the  two  test  periods 
could  still  be  estimated,  and  any  period-to-period  sensitivity  test  could 
still  be  implemented  on  a  reduced  basis.  Termination  of  the  test  after  one 
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test  period,  however,  would  preclude  period-to-period  sensitivity  testing 
and  force  the  assumption  that  differences  between  periods  would  not  have 
been  substantial.  Since  it  was  anticipated  that  differences  between  periods 
would  be  more  pronounced  than  differences  between  segments,  the  statistician 
could  state  that  a  test  having  three  test  periods  of  one  segment  each  would 
be  more  likely  to  yield  general  results  than  a  test  having  one  test  period 
of  three  test  segments.  Of  course,  reducing  either  the  number  of  test 
periods  or  the  number  of  test  segments  to  one  limits  both  the  number  and  the 
value  of  sensitivity  tests  which  could  be  accommodated. 

Another  way  to  accommodate  resource  reductions  would  be  to  reduce  the 
number  of  levels  considered  for  the  primary  factor.  This  could  be  done 
prior  to  test  or  it  might  be  necessary  during  test  execution  if  (as  fre¬ 
quently  occurs)  trials  could  not  be  conducted  as  quickly  as  originally 
anticipated  or  if  a  large  number  of  trials  resulted  in  unusable  data.  Since 
the  levels  of  a  primary  factor  often  represent  incremental  additons  to  some 
baseline  conditions,  they  could  often  be  prioritized  for  deletion,  and  the 
loss  of  one  or  two  levels  might  only  reduce  the  scope  of  potential  results. 

Unplanned  data  loss  on  a  large  scale  (as  much  as  1/3  to  2/3  loss)  is 
unfortunately  not  uncommon  during  a  field  test.  Designs  generated  by  the 
basic  matrix  approach  degrade  gracefully  in  the  presence  of  such  data  loss 
because  the  same  block  structure  which  permits  refined  data  analysis  with 
nearly  complete  data  sets  also  gives  a  solid  framework  for  salvaging  results 
from  a  degraded  data  set.  In  particular,  the  fact  that  the  core  of  the 
present  design  is  a  randomized  complete  blocks  design  in  the  levels  of  the 
primary  factor  assures  that,  even  with  substantial  data  loss,  comparisons 
between  levels  of  the  primary  factor  can  be  made  in  such  a  way  that  they  are 
at  least  free  from  main  effects  of  test  period  and  test  segment.  At  worst, 
differences  between  two  primary  factor  levels  could  be  examined  (using 
either  parametric  or  nonparametric  techniques)  in  all  blocks  (modified  by 
restriction  to  one  combination  of  secondary  factor  levels)  having  usable 
trials  for  each  of  the  two  primary  factor  levels.  To  a  lesser  extent, 
secondary  factors  could  be  examined  within  the  remaining  block  structure 
too.  Each  block  in  the  original  design  could  be  regarded  as  up  to  five 
blocks,  one  defined  by  each  level  of  the  primary  factor  having  an  ooser- 
vation  at  both  combinations  of  secondary  factor  levels.  A  crude  analysis 
could  then  be  performed  using  all  (modified)  blocks  by  examining  differences 
between  levels  for  each  secondary  factor  separately  (ignoring  hopefully  weak 
confounding  of  secondary  factors);  the  apparent  effects  obtained  would  at 
least  be  free  from  main  effects  of  the  primary  factor  as  well  as  those  of 
Lest  period  and  test  segment.  Data  quality,  computer  facilities,  and  time 
permitting,  the  blocking  structure  could  probably  be  exploited  in  more 
elaborate  ways  to  sort  out  main  effects  of  secondary  factors  or  even  inter¬ 
actions  between  primary  and  secondary  factors. 

In  no  case  does  an  underlying  design  structure  which  permits  consider¬ 
ation  of  models  like  Model  I  and  Model  II  preclude  accommodation  of  resource 
reductions  or  hinder  data  analysis  in  degraded  modes.  Provided  the  design 
structure  is  based  on  the  resource  structure,  it  can  accommodate  resource 
reductions  gracefully,  if  not  without  consequences,  and  refined  structural 
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models  can  serve  as  guides  to  the  data  even  when  they  can  no  longer  be  fully 
exploited. 

v  summary  AND  CONCLUSION.  The  design  example  Illustrates  the  flex- 

ibilUy  ,nH*TyUc.l  rlc-HSHT  phi  eh  c,n  b.  0bU1n.d  by  W'*1"?  *"  ^ 
tlvelv  slmole  structural  approach  to  the  design  of  a  field  test,  sucn 
designs  enable  formal  consideration  of  a  large  number  of 
test  factors,  provide  systematic  statistical  guldanc 
conduct,  and  emphasize  test  structure  rather  than  sample  size. 

Although  such  designs  seem  to  have  no  statistical  dis 

characteristic  111  SS.KHS ' Si 

Sri  iss^y"n« 

makes  the  inevitable  shortcomings  in  test  conduct  more  obvio‘£;  “* 
l.the  traditional  approach  is  accepted  and  d«i ndt  force  detailed 

Statistical  planning  *  up=" ,  «£,««<  "j>  ^.t  M  sulce  and 

proposers"0!?*  such  designs  Bust  be  prepared  for  extended  argument  and  fre- 
quent  disappointment. 
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SOME  REMARKS  ON  CROSSOVER  EXPERIMENTS 
J.  ROBERT  BURGE 

WALTER  REED  ARMY  INSTITUTE  OF  RESEARCH 

Introduction. 

Frequently  experimental  observations  are  collected  at  different  times 
on  the  same  sampling  unit.  The  simplest  example  of  such  a  repeated-measure¬ 
ments  design  consists  of  the  administration  of  a  control  treatment  (a  placebo 
or  standard)  on  one  occasion  and  the  treatment  of  interest  on  another.  The 
paired  observation  case,  when  extended,  generates  the  general  setup  of  k 
responses  collected  on  the  same  experimental  unit  at  successive  times.  Designs 
in  which  several  treatments .are  applied  in  successive  periods  to  each  unit  in 
a  cyclic  sequence  are  known  as  change-over  designs.  These  designs  are  often 
exercised  when  units  are  highly  variable,  or  are  expensive  or  scarce. 

Change-over  designs  are  capable  of  providing  treatment  comparisons  of 
high  precision,  because  differences  between  units  can  be  disjoined  from  exper¬ 
imental  error.  This  advantage  a  obtained  at  the  risk  of  possible  complications, 
for  performance  in  a  given  period  might  reflect  more  than  the  direct  effect  of 
the  current  treatment.  These  additional  effects,  known  as  residual  effects, 
exist  if  preceding  treatment  effects  crossover  to  influence  responses  measured 
in  succeeding  periods. 

Cox  (1953)  advises  that  the  crossover  design  be  used  only  when  it  can 
be  assumed  that  residual  effects  are  negligible.  However,  when  the  residual 
effects  do  not  persist  for  more  than  one  period  after  application,  some  provi¬ 
sion  for  the  separation  of  direct  and  residual  effects  can  easily  be  made. 

Still,  it  is  most  useful  when  the  residual  effects  are  relatively  small. 
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Cage  1:  Three  Factor  Repeated  Measurements  Experiment 

Before  discussing  Che  details  of  a  crossover  analysis*  a  repeated 
measures  example  is  considered.  The  summary  of  results  presented  here 
resembles  the  pattern  adopted  for  split-plot  procedures.  While  an 
experimental  unit  (whole  plot),  in  a  spilt  plot  design,  is  subdivided 
physically,  an  experimental  unit  is  split  in  time  in  a  repeated  measures 
design.  That  is,  each  unit  receives  Beveral  treatments  in  successive 
time  periods.  It  is  assumed  in  this  first  example  that  performance  in 
a  given  period  reflects  only  the  direct  effect  of  the  current  treatment, 
a)  Numerical  Example. 

This  repeated  measures  analysis  will  be  demonstrated  using  artificial 
data  based  on  a  study  now  being  planned  at  Walter  Reed.  The  study  will 
be  used  to  compare  two  positioning  techniques  in  reducing  the  discomfort 
from  intramuscular  injection  in  the  dorsogluteal  site.  The  techniques 
involve  placing  patients  in  the  prone  position  with  either  hips  inter¬ 
nally  rotated  (method  A)  or  with  hips  externally  rotated  (method  B) . 

To  illustrate  the  meaning  of  the  data  set  out  in  Table  1,  ten  post¬ 
operative  patients  were  randomly  assigned  to  one  of  two  groups.  The 
five  patients  allocated  to  group  one  received  their  first  period  injection 
(a.m.)  while  placed  in  position  A.  In  period  two  (p.m.)  they  were  placed 
in  position  B  and  the  second  medication  was  injected  Into  the  opposite 
dorsogluteal  site.  For  the  next  two  days  medication  was  supplied  by  the 
oral  route.  On  the  final  day  the  period  one  and  two  injections  were 
administered  in  the  reverse  order  (viz.,  B  then  A). 
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TABLE  1 

ur  a  "i-'<'i-r  SCORES  FROM  INTK‘.MllSCiri,AK  INJECTIONS 


Period  1 
(a.n.) 

Period  2  j 
(p.tn.  >  f 

Period  1 
(a  .m.) 

Patient 

Trt.  A 

Trt.  B 

Trt.  B 

1 

1.0 

2.8 

1.9 

2 

1  .0 

1.5 

l./ 

3 

2.0 

4.0 

)  .9 

4 

1.5 

3.0 

1.5 

5 

2.5 

3.0 

3.0 

Trt.  A 


Pat  lent 


Trt.  B 


Trt.  A 


Trt.  A 


In  contrast,  the  five  patients  assigned  to  the  second  group  received 
the  sequence  B  then  A  on  the  first  day.  On  the  final  day  their  sequence 
was  A  then  B.  The  patients  used  the  discomfort  rating  scale  given  in 
Figure  1  to  assign  scores. 


1 


2 


3 


4 


5 


no  moderately  extremely 

discomfort  uncomfortable  uncomfortat 


FIGURE  1 

DISCOMFORT  RATING  SCALE 


b)  Model. 

It  will  be  assumed  that  the  linear  model  upon  which  the  analysis 
would  be  based  is  a  function  6f  the  main  effects  of  groups  (ya), 
periods  (p^),  treatments  or  methods  (t^)  and  their  interactions. 
Specifically, 

E  [w]  •  " +  ti  +  ’k(i) +  °i +  \ *  TTu  * 

p,jl  +  YPTjjj  . 

The  notation  *  .  „  indicates  the  effect  of  patient  k  nested  under 
k(i) 

group  1.  The  "patient"  factor  is  crossed  with  periods  and  treatment, 
but  is  nested  under  groups. 
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c)  Analysis  of  Variance. 


Assuming  that  all  Interactions  with  the  "patient"  factor  are  aero, 
the  analysis  of  variance  summary  is  shown  In  Table  2.  The  mean  square 
for  patients  nested  In  groups  Is  used  to  test  the  group  effect.  The 
period  and  treatment  effects  are  within  patient  effects, 
d)  Unbalanced  Data. 

Suppose  the  patients  were  oniy  administered* each  treatment  one  time. 
Members  of  the  first  group  receiving  only  the  sequence  A-B;  the  second 
group  being  handled  in  the  reverse  order  B-A.  Then  only  four  of  the 
eight  subclasses  in  Table  1  vould  contain  data  (viz.,  the  non-braclteted 
data  cells).  Nevertheless,  a  feature  of  the  de8ign  for  the  remaining 
data  set,  is  that  It  still  yields  an  analysis  of  variance  that  ia  easy 
to  calculate  and  Interpret.  Table  3  allows  one  to  examine  and  compare 
the  analysis  of  variance  proce:>are  for  the  full  (N-4C)  data  set  with 
the  reduced  set  0*"20) .  The  economy  of  effort  is  desirable  when  the 
lnteractlona  are  zero.  If  not,  the  adequacy  of  the  reduced  design  is 
questionable— estimates  of  the  main  effects  will  be  confounded  by 


interaction  terms. 


TABLE  2 


CASE  2 

SUMMARY  OF  AOV  THREE-FACTOR  EXPERIMENT 


Source  of 
Variation 


Between  Patients 


Groups  (G) 

Error  (a) 

(Patients  w.  groups) 


Periods 

GP 

(P) 

Methods 

GM 

(M) 

PM 

GPM 

Error  (b) 

(  P  x  Patients  w.  groups 

df 

MS 

9 

1 

2.704 

8 

2.3102 

30 

1 

0.841 

1 

3.844 

1 

7.056 

1 

2.401 

1 

1 

— 

24 

0.43408 

1.1704 


16.2549 


Table  3  AOV  Scheme  for  Balanced  Case  (N“40)  va  Ml b Bing  Celia  <N*20). 


V— 

Source  of  Variation 

Eil 

1fT« 

Source  of  Variation 

Between  Patients 

9 

9 

Groups 

1 

1 

Groups 

Error  (a) 

[Patients  V.  Croups} 

8 

8 

Error  (a) 

Within  Patients 

30 

10 

Periods 

1 

1 

Periods 

GP 

1 

- 

Methods 

1 

1 

Methods 

CM 

1 

- 

PM 

1 

_ 

GPM 

1 

- 

Error  (b)  _ 

JP  x  Patients  w.  Croups  J 

I  M  x  Patients  v.  Groups  ] 
[PM  x  Patients  w.  Groups 

24 

8 

Error  (b) 

■  ■ 

Allas 

PM 


CM 

GP 


■4 


TOTAL 


TOTAL 


Case  2:  The  Simple  Two-Treatment  Crossover  Experiment 


lo  the  example  we  consider  next,  each  experimental  unit  (patient) 
has  two  periods  of  treatment  supplied.  The  treatments,  say  A,  B,  to 
he  compared  are  randomly  assigned  to  the  two  periods.  Usually,  equal 
numbers  of  units  are  allocated  to  two  groups.  Members  of  the  first 
group  receive  the  treatment  sequence  AB;  the  second  group  is  treated 
in  the  reverse  order  BA. 


FIGURE  2 

NOTATION  AND  LAYOUT 
FOR  THE  SIMPLE  CROSSOVER  EXPERIMENT 


a)  Numerical  Example. 

The  reduced  data  set  (N-20)  of  the  previous  example  fits  this 
layout  of  a  change-over  experiment.  It  will  be  used  to  demonstrate 
an  analysis  that  follows  the  summarization  adopted  by  Lucas  and  Patterson 
(1962). 


b)  Model. 

The  nodal  for  the  observation  is  *ijk 


y  ♦  Pk  +  *  Xr  +  *ij  +  CW 


where 


V 


Pk' 

♦» 

Xr 


eijk 


•  overall  mean 
.  effect  of  the  kth  period 

.  effect  of  the  .th  tre.tu.nt  (.ethod)  »  -  A.  B 

.  effect  of  the  tth  tre.te.nt  d»  the  let  Bttlod  m  the 

response  in  the  2nd  period 
.  the  effect  of  the  1th  euhl.ct  io  the  1th  group 

i  -  1.  2  j  •  1,  2,  *ij 

.  vlthln  eubject  devletlon  for  the  kth  period 
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The  components  of  the  vectors  b%  end  b'j  form  subsets  of  the 
set  of  Greek  letters  used  to  lebel  the  columns  of  the  20  x  13  matrix  Z. 

*  0— 

This  association  is  used  to  determine  the  corresponding  columns  one 
selects  from  Z  to  build  the  equations  Y  "  and  \ 

used  to  perform  the  analysis  presented  in  Tables  4  and  5. 
d)  Analysis  of  Variance. 

The  regression  approach  fo  AOV  offers  a  computationally  convenient 
algorithm  for  generating,  from  the  Y  -  equations,  the  various 

entries  in  the  AOV  table.  The  RO-notation,  employed  in  the  tables, 
serves  to  clearly  describe  the  way  sums  of  squares  were  computed. 

A  complete  summary  of  R()-notation  is  given  in  Searle  (1971). 


TABLE  4 
AOV  FCR  CAS 
[simple  two-treatment  ch 


Source  of 

Variation 

t 

df 

/ 

1 

SS 

Between  Units 

9 

10.188 

Groups 

1 

1.352 

Units  v.  Groups 

8 

8.836 

Within  Units 

10 

Periods 

1 

0.200 

Methods 

1 

10.658 

Error  (b) 

8 

3.292 

Total 

19 

24.338 

RQ-Notatlon 


Since  direct  and  residual  treatment  effects  are  not  orthogonal, 

tvo  partitions  of  the  total  sum  of  squares  for  treatments  are  given  in 

Table  5.  Only  the  first  of  these  is  required  if  residual  effects  are  of 

no  Interest  apart  from  their  use  in  adjusting  direct  effects.  Both  will 

be  required  when  residual  effects  are  tested.  F-tests  for  direct  and 

2  5 

residual  effects  can  be  carried  out  by  dividing  s  d  and  *  r>  respectively 
by  s  e.  A  summary  of. the  experiment  is  presented  in  Table  6.  The  first 
part  of  this  table  gives  the  summary  required  when  residual  effects  are 
present.  The  second  part  presents  the  summary  required  when  the  analysis 
shows  residual  ef Teres  are  negligible  (i.e.,  estimates  of  direct  effects 
unadjusted  for  residual  effects,  but  adjusted  for  differences  between 
units,  are  give') 


TABLE  6 

SUMMARY  OF  RESULTS  CASE  2  DATA  SET 


Residual  Effects  Present 


(jj  +  +  #) 

Group  1  Period  1 
Trt.  A 


Group  2  Period  1 
Trt.  B 


.A.  A  A 

(p  +  Px  -  4> ) 


,A  * 

_(u  -  Pi  -  ♦  +  *) 


Group  1  Period  2 


Trt.  B< 


1.6 

2 

vO 

00 

_ 1 

na5 

n«5 


.497  -  Se 


\  3/5 


means 


1.92 


n"5 


Group  2  Period  2 
Trt.  A, 


B 


A  A  A  ^ 

(u  -  +  ♦  ~  X) 


Residual  Effects  Negligible 

Trt.  A 

.A  A  /  . 

(p  +  4»  ) 

1.76  - 

n*10 


'  .203 


Trt.  B, 

(P  -  %  ) 

means - 1  3.22 

n®10 

Se  aJi/10* 


36 


Experiment 


The  two  period  crossover  design  for  clinical  trials  was  compared 
with  other  simple  designs  in  terms  of  statistical  precision  and  cost 
fey  Brown  (Biometrics,  March  1980). ■  In  Brown's  article  an  alternative 
analysis  of  a  simple  crossover  study  was  illustrated.  The  data  from 
this  example  are  presented  in  Table  7.  The  response  of  interest  is 
an  oral  hygiene  Index  used  to  compare  a  test  compound  with  e  placebo 
with  regard  to  effect  on  dental  hygiene.  Summary  statistics  and 
tests  of  significance  for  the  data  are  given  in  Table  8.  Results  of 
estimating  the  treatment  effect  on  the  basis  of  first -period  data 
alone  are  reproduced  in  Table  9.  For  comparative  purposes,  the  summary 
results  from  the  hygiene  data  are  presented  using  the  "Lucas  Format"  in 
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Table  10. 


Subject 


_ Grout 

Period  1 
Placebo 


Period  2 
Test 


_ Group 

Period  1 
Test 


II 

Period  2 
Placebo 


TABLE  8 

SUMMARY  STATISTICS  AND  TESTS  OF  SIGNIFICANCE 
FOR  THE  HYGIENE  DATA 


Analysis  of  Differences 


Mean 

Variance 

Number 


Group  I 
0.5985 
1.3268 
34 


Group  II 
-0.9440 
0.5173 
30 


Pooled  Var. 

Est.  Trt.  Effect 
St.  "Error 

t  6.32 

df  -  62 

p  <  .001 


0.9482 

0.7712 

0.1220 


-  TABLE  9 

Summary  Statistics  and  Tests  of  Significance 
of  the  First-Period  Data  from  the  Dental  Hygiene  Study 


Group  1  Croup  II 


Mean 

0.7597  1.3663 

Variance 

0.6  0.3845 

Humber 

34  30 

Pooled  Var 

0.4992 

Trt*  Effect 

0.6066 

St.  Error 

0.1770 

t 

3.4271 

df 

62 

P 

0.001 

Basically,  Brown's  article  is  critical  of  crossover  studies. 

The  analysis  of  differences,  he  points  out,  relies  on  the  assumption 
that  a  therapy  has  a  certain  additive  effect  that  does  not  depend 
upon  the  time  period  the  treatments  were  administered.  That  is, 
response  to  a  tr^tment  should  not  be  Influenced  by  whether  or  not 
the  other  treatment  was  just  given.  For  the  hygiene  data,  the 
analysis  of  sums  suggests  the  assumption  is  questionable;  thus,  the 
possibility  of  a  large  direct  treatment  effect  bias  exists  due  to 
residual  effects.  When  the  assumptions  are  contradicted.  Brown 
recommends  one  only  utilize  first  period  data.  Although  his  data  set 
was  large  enough  to  evaluate  treatment  effects  in  this  way,  he  reminds 
us  that  this  Is  the  exception,  not  the  rule,  for  change-over  designs. 
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TABLE  10 

PARTITION  OF  TREATMENT  SUM  OF  SQUARES:  HYGIENE  DATA  SET 


Source  of 
Variation 


Between  Subjects 
Groups 

Subj .  w.  Groups 

tflthin  Subjects 
Periods 
Treatments 
Error  (b) 


Total 


Direct  Effects 
(eliminating  residual) 

Residual  Effects 
(Ignoring  direct  effects) 


Treatments  -  td  and  TR 
(direct  and  residual) 

or 

Direct  Effects 
(Ignoring  residual) 

Residual  Effects 
(eliminating  direct  effects) 


df 


63 

1 

62 


1 

1 

62 


127 


SS 


18.5993 

0.86264 

17.7367 


0.5 

18.95456 

29.40679 


67.46069 


5.86495 

13.95225 


19.8172 

18.95456 

0.86264 


R()-Notatlon 


R(p|u) 

R(TD|y,  p) 


Y'Y  -  R(y) 


R(TD  |y,  p,  TR) 

r(tr|w,  p) 


R(Td|w,  p) 
R(Tr|w,  p,  TD) 


F 


3.015  1.736 


39.963 


12.36 


6.32 


3.52 


Treatments 

(direct  and  residual) 


2 


19.8172 


Summary  -  Brovn  vs.  Lucas 

Brown' s  format  for  crossover  nnalysis  was  applied  to  the  injection 
data  and  the  results  are  presented  in  Tables  11  and  12.  Summary 
results  from  the  injection  data  are  presented  using  the  Lucas  format 
in  Table  13.  Brown  bases  his  test  for  a  residual  effect  on  the 
"analysis  of  sums".  Lucas  would  carry  out  the  F-test  for  residual 
effects  by  comparing  s2f  with  s2e-  The  tests  differ.  Brown  chooses 
between  group  variability  for  an  error  term,  while  Lucas  chose 
vithin-subject  variability.  Further  differences,  as  well  as 
similarities,  between  the  other  tests  of  significance  utilized  by 
these  two  approaches  are  set  out  in  Table  14. 
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TABLE  11 

SUMMARY  STATISTICS  AND  TEST  OF  SIGNIFICANCE 


Mean 

Variance 

Number 

Pooled  Variance 
Esc.  Trt.  Effect 
St.  error 
t 
df 
P 


Mean 

Variance 

Number 

Pooled  Variance 
Est.  trt.  effect 
St.  error 
t 


Analysis  of  Differences 


Group  I 

-1.26 

0.513 


Group  II 

1.66 

1.133 


0.823 

-2.92 

0.5737 

-5.089 

8 

0.00094 


Analyala  of  Sums 

Group  I 

4.46 

1.933 

5 


Croup  II 

5.50 

2.485 

5 


2.209 

-1.04 

0.94 

-1.1064 


TABLE  12 

SUMMARY  STATISTICS  AND  TESTS  OF  SIGNIFICANCE 


Analysis  of  Period  One  Scores 


Mean 

Var 

n 


Group  I 

1.6 

.425 

5 


Pooled  Var 
Est.  trt.  effect 
St.  error 
c 
df 
P 


.7085 

-1.98 

0.5323533 

-3.719 

8 

.006 


Group  II 

3.58 

.992 

5 


TABLE  13 

PARTITION  OF  TREATMENT 
SUM  OF  SQUARES 


Source  of 


Variation 

df 

SS 

MS 

Between  Units 

9 

10.188 

Groups 

1 

1.352 

Units  v.  Groups 

8 

8.836 

Within  Units 

10 

P 

1 

0.20 

Treatments 

1 

10.658 

10.658 

Error 

8 

3.292 

0.4115 

Total 


19 


df 

SS 

R() -Notation 

Direct  Effects 
(eliminating  residual) 

1 

9.801 

R($|u,  ",  P,  *) 

*2 
s  d 

Residual  Effects 
(ignoring  direct  effects) 

1 

2.209 

R(X  j  v,  ",  P) 

Treatments 

(direct  and  residual) 

2 

12.01 

R(P,  l|y,  ",  p) 

OR 

Direct  Effects 
(ignoring  residuals) 

1 

10.658 

R(4»|y,  ",  p) 

‘V 

Residual  Effects 
(eliminating  direct) 

1 

1.352 

R(X|y,  n,  p,  4) 

$2r 

2  12.01 
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TABLE  14 

A  SUMMARY  RELATING  TESTS  OF 
SIGNIFICANCE  FOR  TWO  APPROACHES  TO 
CROSSOVER  ANALYSIS 


1.  Analysis  of  Differences 


(Brown) 

(Lucas) 

t2  •(  -2.92^  2  - 

(-5.089)2  -  25.9  -  10.658  -  S*2d 

\  .5737y 

.4115  ^2 

Analysis  of  Sums 

t2  -/'-1.04V  -  (■ 

-1.1064)2  -  1.224  -  1.352  -  S2r 

(8.836/8)  MS  units 

NOTE:  3.286  -  1.352  -  S2r 

.4115  s2 
e 


3.  Analysis  of  First  Period  Scores 
(residual  effects  present) 


t2 


( 


-1.98 

£(.  7085X4/10) 


(-3.719)2  -  13.83  -  9.801  -  S^d 
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A.  TIME  SERIES  ANALYSIS  AND  MODELING  APPROACH  OF 
SENSE  AND  DESTROY  ARMOR  (SADARM)  RADIOMETRIC 
”  (ELECTROMAGNETIC  RADIATION)  NOISE  DATA 


Richard  T.  Maruyama 
Probability  and  Statistics  Branch 
Ballistic  Modeling  Division 
U.S.  Army  Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,  Maryland 


ABSTRACT.  A  series  of  tests  were  conducted  in  August  1978  to  collect 
radiometric  (electromagnetic  radiation)  data  at  the  North  East  Test  Site  of 
the  Rome  Air  Development  Center  in  Rowe,  N.Y.  This  radiometric  data  was 
collected  using  the  Sense  and  Destroy  Armor  (SADARM)  target  sensoring 
system.  Both  background  (grounT  noise)  and  target  (tanks)  data  were 
collected  to  investigate  the  signal  characteristics  of  the  SADARM  weapon 
system.  The  data  was  recorded  at  equally  spaced  time  intervals  over 
five  ranges.  The  objectives  of  the  SADARM  sensor  are  to  detect  and  then 
8im  the  antiarmor  munition  at  the  target. 

This  paper  presents  the  Box  and  Jenkins  time  series  modeling  effort  on 
the  background  radiometric  data.  This  effort  resulted  in  an  Autoregress ive- 
Moving  Average  (ARMA)  model  of  order  p*l  and  q«l,  where  the  autoregressive 
parameter  ranged  from  0.73  to  0.88  and  the  moving  average  parameter 
ranged  from  -0.59  to  -0.64.  This  ARMA(1,1)  model  seems  adequate  for 
characterizing  the  background  noise  of  the  SADARM  weapon  system. 

1.  INTRODUCTION.  The  Sense  and  Destroy  Armor  (SADARM)  is  a  "Fire 
and  Forget"  antiarmor  munition  being  developed  and  tested  by  the  US  Arm y 
Armament  Research  and  Development  Command  (ARRADCOM) .  The  lead  laboratory 
for  the  program  is  ARRADCOM* s  Large  Caliber  Weapon  System  Laboratory 
(LCWSL).  The  systems  analysis  and  sensor  technology  research  is  ongoing 
at  the  Ballistic  Research  Laboratory  (BRL) ,  Aberdeen  Proving  Ground,  MD. 

As  part  of  this  BRL  effort  the  SADARM  sensor  was  tested  in  August  1978 
at  the  North  East  Test  Site  of  the  Rome  Air  Development  Center  in  Rome, 

NY.  This  electromagnetic  radiation  data  collected  in  Rome,  NY,  is 
essentially  the  absolute  temperature  of  the  ground  surface  at  the  sensor's 
focus  point,  and  is  referred  to  as  radiometric  readings.  The  functional 
relationship  between  absolute  temperature  and  the  radiometric  reading  is 
given  in  equation  (1) : 

n  *  2kT/A2  (1) 

where  k  is  Boltzmann's  constant 

T  is  the  absolute  temperature 
and  X  is  the  wavelength. 


371 


The  radiometric  data  was  collected  over  field  and  tree  terrains,  at 
five  different  distances  that  ranged  from  SO  to  ISO  meters  (see  Figure  1) . 
The  SAD ARM  sensor  was  mounted  on  a  helicopter  at  a  slant  angle  of  30* 
measured  from  perpendicular  to  the  ground.  The  helicopter  was  then 
flown  at  a  ground  speed  of  60  knots.  The  SADARM  sensor  recorded  approxi¬ 
mately  two  thousand  equally  spaced  observations  (data)  per  second  and 
was  instrumented  to  make  four  (4)  revolutions  per  second  (RPS) . 

The  object  of  the  sensor  is  to  detect  the  target  and  then  aim  the 
weapon  system  under  battlefield  conditions.  Hence,  the  response  of  the 
SADARM  sensor  to  varying  background  conditions  will  effect  the  weapon 
system's  ability  to  detect  targets. 

Presented  in  this  report  is  the  Box  and  Jenkins  modeling  approach 
for  the  radiometric  non-target  observations.  The  analysis  demonstrates  the 
ability  of  the  ARIMA  model  to  characterize  the  SADARM  data. 


2.  THE  BOX  AND  JENKINS  TIME  SERIES  yPROACH.  The  Box  and  Jenkins1 
Autoregressive  Integrated  Moving  Average  (ARIMA  (p,d,q))  model  is  used  to 
characterize  many  types  of"T>usiness,  economics  and  engineering  observations. 
The  need  to  develop  a  model  of  the  SADARM  sensors'  responses  to  backgrounds 
(terrain)  has  been  ongoing  at  BRL.  In  order  to  satisfy  this  requirement 
the  Box  and  Jenkins  approach  was  initiated. 

A  representative  plot  of  the  August  1978  SADARM  data  is  presented  in 
Figure  2. 

The  ARIMA  (p,d,q)  model  is  presented  below: 

4>p(B)(l-B)d(Zt-w)  .  Bq(B)at  (2) 

where  B  is  the  backshift  operator  such  that  BZt  ■  Zt_1# 


<t»p(B)  is 


a  polynomial  in  B  of  order  p  and  ^  are  autoregressive  parameters. 


0(B)  is 


a  polynomial  in  B  of  order  q 


and  0^  are  moving  average  parameters. 


p,d,q  are  non-negative  integers,  and 


a  are  random  shocks  (white  noise)  assumed  to  be  independently 

z  2 
distributed  normal  variates,  N(0,o_). 

A  series  of  five  hundred  observations  were  analyzed  and  used  to 
estimate  the  Autocorrelation  Function  (ACP)  and  Partial  Autocorrelation 
Function  (PACF)  out  to  40  lags  (see  Figure  3  and  Table  1) . 
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TERRAIN  (field,  trek) 

NORTH  EAST  TEST  SITE  OF  THE  ROIC  MR  DEVELOPMENT  CENTER  IN  IMC, 
AUGUST  1971 

FIGURE  1 


SADARrt  DATA  (NO  TARGET) 


In  building  a  dynamic  time  series  model  three  steps  are  required; 
(1)  tentatively  identify  the  model,  (2)  fit  the  data  to  the  model,  and 
(3)  perform  a  diagnostic  check  for  lack  of  fit.  The  identification  step 
requires  an  overall  view  of  the  data  structure.  In  this  case  a  damping 
sinusoidal  structure  for  the  ACF  and  two  significant  spikes  at  the  first 
and  second  PACF  are  displayed  (see  Figure  3) .  The  estimated  mean  (u) 
and  standard  deviation  (o)  of  this  time  series  are  -1.603  and  3.712, 
respectively. 

This  identification  step  implies  that  an  autoregressive  model  of  order 
p«2  be  tentatively  entertained.  Hence,  the  ARIMA  (2,0,0)  model  was  fitted. 


ARIMA  (2,0,0)  Tentatively  Entertained 

(1  -  OjB  -  *2B2)(Zt  -  y)  »  at  (3) 


where  the  estimated  parameters  are  y  ■  -1.636 

-  1.2S2 

A 

$2  «  -0.450 


The  autocorrelation  function  of  the  residual,  a^  ■  -  Z ,  of  the 

ARIMA  (2,0,0)  model  was  then  looked  at  for  lack  of  fit.  These  residual 
ACF  are  listed  in  Table  II,  where  the  estimated  residual  mean  and  standard 

A  A 

deviation  are  yQ  *  0.00175  and  a  *  1.681.  The  ACF  of  the  residuals  at 
at  at 

lags  k=l  and  k=2  indicate  some  remaining  residual  structure.  Also,  the 
cutoff  of  the  residual  ACF  demonstrates  a  possible  need  for  a  moving 
average  term  in  the  model. 

Hence,  the  ARIMA  (2,0,1)  model  was  entertained  to  remove  the  spikes 
in  the  residual  ACF.  The  ARIMA  (2,0,1)  model  is  as  follows: 


.ARIMA  (2,0,1) 

(1  -  4>jB  -  $2B2)(Zt  -  y)  *  (1  -  01B)at  (4) 

A 

where  the  estimated  parameters  are  y  =  -1.622 

A 

=  0.886 
A 

<t>2  =  -0.138 

A 

0j  -  -0.506  . 
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The  addition  of  the  moving  average  term,  did  remove  the  autocorrelation 
spikes  for  lags  k  ■  1  and  k  ■  2  (see  Table  III) .  Inspection  of  the  residuals 

with  mean  y  *  0.0009  and  standard  deviation  o  *  1.616  indicated  a  good  fit 
at  t 

(see  Figure  5) .  The  9£%  confidence  interval  for  the  second  estimated  auto* 
regressive  parameter,  <t>2  *  -0.138,  overlapped  the  zero  point  (-0.285,  0.0086). 

This  suggested  the  possible  removal  of  this  term  from  the  model.  Rased  upon 
this  information  and  the  principle  of  parsimony  in  the  use  of  parameters, 
the  second  autoregressive  parameter  was  removed  and  the  ARIMA  (1,0,1)  model 
was  considered. 

The  ARIMA  (1,0,1)  model  is  as  follows: 

ARIMA  (1,0,1) 

(1  -  ♦1B)(zt  -  V)  -  (1  -  81B)at  (5) 

A 

where  the  estimated  parameters  are  y  <*  -1.618 

<j>j  =  0.7553 

-  -0.5960  . 

Both  the  ACF  and  PACF  of  the  ARIMA  (1,0,1)  residuals  indicated  a  lack  of 
any  remaining  structure.  Table  IV  and  Figure  6  show  that  the  residual 

(at  »  zt  -  zt)  no  longer  contains  any  structure.  Further  data  analysis  to 

better  characterize  this  time  series  was  unsuccessful. 

A  summary  of  the  models  and  their  estimated  parameters  are  presented 
in  Table  V. 
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3.  APPLICATION  Or  ARIMA  ‘(1,0,1)  MODEL  TO  THE  REMAINING  SADARM  DATA. 

In  Section  2  the  rationale  for  the  selection  of  the  AR1MA  (1,0,1)  mode) 
was  presented.  In  this  section  the  remaining  data  for  the  non-target  cases 
are  analyzed.  The  ARIMA  (1,0,1)  model  was  found  to  be  adequate  for  modeling 
the  non-target  SADARM  data  collected  in  Rome,  N.Y. 

The  first  step  is  to  investigate  more  of  this  data.  In  Table  VI,  both 
the  ACF  Bnd  PACF  of  additional  samples  of  S00  observations  at  different 
distances  (30,  60,  90,  120  and  150  meters)  were  estimated.  Plots  of 
both  are  presented  in  Figure  7.  The  similarity  in  the  correlograas 
indicate  the  possibility  of  modeling  all  the  SADARM  non-target  data  with 
the  same  ARIMA  model.  Hence,  Table  Vll  was  generated  from  other  sets  of 
SADARM  data  using  the  ARIMA  (1,0,1)  model. 

This  analysis  suggested  that  the  means  (y)  are  varying,  but  that 
the  autoregressive  parameter  ($.)  and  the  moving  average  parameter  (0.) 

are  not.  The  estimated  parameters  (y  ,  o  )  for  white  noise  (random  shocks) 

are  consistent  for  those  cases  investigated.  That  is  y  *  0.0  and 

a  A 

0#  *  (1.62  to  1.79).  A  closer  look  at  the  residuals,  a^,  indicates 

a  lack  of  any  consistent  pattern  after  fitting  the  ARIMA  (1,0,1)  model. 

Figure  8  was  constructed  to  demonstrate  the  lack  of  structure  in  the  residuals, 
indicating  the  ability  of  the  ARIMA  (1,0,1)  model  to  characterize  all  the 
SADARM  non-target  data.  This  ability  to  model  the  different  cases  by  a  white 
noise  model  is  used  to  simulate  the  sensor's  characteristics.  The  ARIMA 
(•)  model  used  for  this  purpose  is  that  of  Equation  (6). 

A  A  A  M 

zt  *  Cl  -  4>)y  *  at  -  0at_2  ♦  4>*t_j  (6) 

''2 

where  a  =  N(0,  o  )  and 

t.  & 

AAA 

(y,4>,6)  are  the  estimated  parameters. 

A  plot  of  one  such  simulated  case  is  shown  in  Figure  9  as  a  comparison  to  the 
actual  data  plotted  in  Figure  2. 


4.  SUMMARY .  The  SADARM  data  collected  in  August  1978  was  analyzed 
using  the  Box  and  Jenkins  Time  Series  approach.  This  approach  indicated 
an  ARIMA  (1,0,1)  model.  This  particular  ARIMA  model  characterizes  the 
data  remarkably  well.  What  is  more  remarkable  is  the  consistent  behavior 

A  A 

of  both  the  estimated  parameters  (4^,  6j)  and  the  white  noise  parameters 
(*t.  o#).  The  indications  are  that  the  ARIMA  (1,0,1)  structure  is  adequate 
for  describing  this  set  of  SADARM  data. 
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TO  a!  ROLE  0?  SPATIAL  BANDWIDTH  LIMPS  IN  THE  MEASUREMENT  AND  INTERPRETATION 
0?  SECOND -ORDER  STATISTICAL  PROPERTIES 


S.  L.  Church 
Technology  Branch 
Armaments  Division 

Flro-Control  and  Small  Caliber  Weapon  Systems  Laboratory 

US  ARRADCOM 

Dover,  New  Jersey  07801 


ABSTRACT.  Many  important  physical  processes  depend  on  the  second-order 
statistics  of  a  random  variable}  that  is,  its  autocovariance  function  or  power 
spectral  density.  The  measurement  and  specification  of  surface  topographic 
finish  is  a  case  of  particular  interest  to  the  Amy.  However,  the  spectra  of 
the  profiles  of  manufactured  surfaces  are  frequently  of  the  power-law  form 
W  -  p”8,  where  p  is  the  3patial  frequency  and  s  is  a  number  '■vw  1,  2.  Con¬ 
sequently,  the  classical  spectral  moments  —  corresponding  to  the  surface 
height,  slope  and  curvature  variances  —  diverge  when  these  spectral  densities 
are  extrapolated  to  very  low  and  very  high  spatial  frequencies,  and  one  is 
forced  to  give  up  the  idea  of  intrinsic  finish  parameters  and  to  deal  with 
bandwidth-limited  values  instead.  There  is  therefore  a  critical  need  to  under¬ 
stand  the  role  of  spatial  bandwidth  limits  in  the  measurement  and  specification 
process.  The  present  paper  addresses  this  need)  It  discusses  the  role  of 
second-order  statistical  functions  and  moments  in  the  surface-characterization 
problem;  the  effects  of  bandwidth  limits  on  the  magnitudes  of  the  spectral 
moments  and  the  relationship  between  profile  and  area  moments;  and  concludes 
with  a  discussion  of  the  origin  and  magnitudes  of  the  bandwidth  limits  in  a 
number  of  generic  measurement  situations. 

1.0  INTRODUCTION. 

1.1  RELEVANCE  AND  PROBLEM  STATEMENT.  Surface-finish  measurement,  charac- 
terlzation  and  specification  are  an  important  problem  for  the  military  and  in¬ 
dustry.  This  new  interest  in  an  old  area  is  due  to  a  number  of  factors:  The 
development  of  new  manufacturing  techniques  for  mechanical,  electronic  and  opti¬ 
cal  surfaces;  increasing  emphasis  on  standardization,  interchangeability  and 
reliability;  cost  savings  inherent  in  realistic  as  opposed  to  over-specification; 
and  concerns  for  energy  and  material  conservation. 

The  present  standards  for  surface  finish  are  stated  in  terms  of  the  average 
deviation  of  the  surface  profile  from  its  mean.  This  is  inadequate  because  it  is 
insensitive  to  the  transverse  character  of  the  roughness  and  i3  only  indirectly 
related  to  the  surface  area,  which  determines  the  functional  properties  of  sur¬ 
faces  . 

The  statistical  basis  for  a  more  comprehensive  description  of  topography  was 
developed  many  years  ago  by  Longuet-Hlggens  in  a  famous  series  of  papers  concern¬ 
ing  ocean  waves, and  has  been  adapted  more  recently  to  mechanical  surfaces. ^ 

Unfortunately,  however,  this  classical  approach  does  not  take  account  of  two 
related  real-world  situations:  First,  that  measurement  techniques  and  functional 
properties  of  surfaces  are  sensitive  to  only  limited  ranges  of  surface  spatial 
wavelengths;  and  second,  that  the  spectral  densities  of  real  surfaces  are  such 
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that  they  do  not  '’fit”  into  those  wavelength  windows.  In  fact,  the  spectra  of 
Stapes /are  frequently  of  a  form  which  diverges  when  extrapolated  to  vepr  long 
or^vary^short  spatial  wavelengths .  The  signature  of  the  classical  description 
?!  th7t  it  is  expressed  in  terms  of  intrinsic  surface  properties,  while  in  fact, 
“e  Jin  only  speak°in  terns  of  bandwidth-limited  quantities  -  the  observed  and 
the  observation  can  no  longer  be  separated. 

We  address  three  issues  in  this  paper.  1)  The  classical  of 

the  characterization  problem  to  include  second-order 

2)  The  effects  of  bandwidth  limits  on  the  magnitudes  of  the  profile-  and  surface 
finish  parameters  and  the  relationship  between  them.  And  3),  the  iouish  of 
bandwidth  limits  in  various  measurement  situations.  For  simplicity 
discussion  to  surfaces  whose  roughness  is  purely  random,  isotropic,  and  weakly 
stationary  and  weakly  ergodic. 

2.0  CLASSICAL  DESCRIPTION. 

2.1  FIRST-ORDER  STATISTICS.  The  most  general  probabilistic  d®®cfiP^ia}, of 
a  random  vaflaKle,  1,  is  giv«n~In  terms  of  the  N-th  order  Joint  probability  dis¬ 
tribution 


p^v-’V 


(1) 


Where  -  Z(x.  .y<  )  is  the  surface  height  at  the  point  i.  The  location  and  num¬ 
ber  of  points/1/  is  chosen  to  provide  adequate  characterization  for  the  purpose 

at  hand. 

The  simplest  case  is  N  -  1;  the  first-order  height  distribution  function 

P(Z1)  -  Jdz2  •••  f«np<Zl,*’,,zn)  *  (2) 


The  simplest  example  of  this  is,  of  course,  the  Oaussian 

P(Z1)  -  (1/  t/2iio)exp(-Z^/2a2) 


(3) 


where  cr2  is  the  height  variance;  that  is,  o  is  the  standard  deviation  or 
rma  value  of  the  surface  height. 

In  practice  the  firat-order  height  distribution  function  is  characterized  by 
a  set  of  finish  parameters  —  the  central  momenta. 


M 


dZP(Z) I Z 1 


U) 


where  n  •  1.  2,  3,  b  correspond  to  the  average  surface  height,  the  height  vari¬ 
ance,  its  skew  and  kurtosis,  respectively.  The  present  US  and  international 
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atandards  for  surface  texture  are  stated  only  in  terms  of  the  average  height: 


R 

a 


where  *  * s 


r+L 

M  -  <  |Z|  >  -  tiro  (1/2L) I  dx)2(x)| 

L-*-  -f-L 


(5) 


14 

denotes  the  ensemble  average  and  2L  is  the  sample  record  length* 


2.2  SECOND-ORDER  STATISTICS.  The  description  of  Z  in  terms  of  first-order 
statistics  alone  is  clearly  limited:  it  conveys  no  information  about  the  trans¬ 
verse  character  of  the  surface  roughness.  For  example,  the  two  surface  profiles 


(6) 


have  the  sane  first-order  statistics  but  very  different  functional  properties. 
This  forces  one  to  consider  the  inclusion  of  higher-order  distributions  in  the 
characterization  process. 

The  next  step  in  sophistication  involves  the  second-order  Joint  probability 
distribution  function 


P(ZltZ2)  -  j  d23“-|dZnP(Z1.--*,Zn)  . 

The  simplest  example  of  this  is  again  the  well-known  Gaussian  result 


(7) 


P(ZrZ2)  -  (l/2a/o"!*-C2)exp[-  (o2Z|  -  ZCZ^  +  o2Z2)/(2(o-  -  C2)J  . 


(8) 


Here  C  is  the  height  autocovariance  function, which  may  be  viewed  as  the  simplest 
Joint  moment  of  the  second-order  probability  distribution: 


C(xlfx2)  -  Mn  -  dzj  dZ2P(Z1,Z2)zJz2  . 


(9) 


In  contrast  with  the  first-order  case  such  moments  are  functions  rather  than  num¬ 
bers  j  in  this  case  e  function  of  the  two  observation  points,  57  and  x9.  At  the 


tana  x? . 
her,  ir 


the  sur- 


particular  point  »  X2,  C  ■  o  ,  the  height  variance.  Fur 
face  height  is  a  weakly  stationary  random  variable,  the  autocovariance  is  only  a 
function  of  the  magnitude  of  the  separation  of  the  two  observation  points:  the 
lag  t  -  |  Xl  -  x2|  . 

2.3  SECOND-ORDER  FUNCTIONS.  The  height  autocovariance  function  end  its 
Fourier  transforms  form  e  family  of  functions  that  are  used  to  describe  the  second- 
order  statistical  properties  of  a  random  variable.  If  the  height  Z  ie  weakly  sta¬ 
tionary  and  weakly  srgodic  the  covariance  function  can  be  written  in  the  equivalent 
forms 


C(t)  - 


<  ziY 


r  2L-t 

-  Um  (1/2L)  dxZ(x)Z(x  +  t) 
L-*®  '  0 


(10) 
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The  one-dimensional  power  spectral  density  —  which  appears  in  the  discussion  of 
surface  profile  data  —  is  the  cosine  transform  of  the  autocovarlance  function) 


Wx(p)  -  (1/ioJo  dTCOs(pr)C(T)  - 


tin  (l/2it)<  (1/2DIP  dxexpdpx)Z(x)  |2  > 
L-h»  J-L 


(11s) 


(lib) 


where  the  parameter  p  is  the  surface  spatial  wavenumber;  that  is*  2it  times 
the  spatial  frequency.  The  two-dimensional  power  spectral  density  —  which 
appears  in  the  discussion  of  isotropically  rough  surfaces  —  is  its  zeroth-order 
Henkel  transform: 


W2(p)  -  (l/2v)|  TdtJ0(pT)C(T)  - 

-  Urn  <  (1/itR2) l j  xdxJ0Cpx)Z(x))2  > 


(12a) 


(12b) 


where  Jq  is  the  ordinary  Bessel  function  of  zeroth  order. 

When  Z  is  weakly  stationary  —  that  is,  the  power  spectra  are  derived  from  a 
cannon  autocovariance  function  according  to  Eqs.  (11a)  and  (L2a)  —  the  one-  and 
two-dimensional  power  spectra  are  related  to  each  other  through  the  Abel  transforms 


wx(p) 


if 

' p  i42  -  p2 


(13a) 


w2(p) 


/t2"  -  p2 


(13b) 


These  integral  transforms  are  related  to  half -integral  and  half-derivative  opera¬ 
tions,  respectively.  Convenient  tables  of  Cosine,  Hankel  and  Abel  transforms  are 
given  in  the  Bateman  collection. 3 

Figure  1  ia  a  sketch  of  the  various  interrelationships  among  tha  various 
quantiiies  discussed  above.  A  particular  trio  of  functions  which  are  relevant 
to  the  following  discussions  is  given  in  the  Appendix. 

2.1i  SECOND-ORDER  FINISH  PARAMETERS.  The  second-order  statistical  functions 
discussed  above  may  themselves  be  characterized  by  a  set  of  finish  parameters 
corresponding  to  the  central  moments  of  the  power  spectral  densities.  Specifically, 
the  even  n-th  order  profile  moments: 


,n/2 

2 - -  12 
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These  moments  have  direct  physical  meaning  as  indicated  on  the  right:  «q  is  the 
variance  of  the  profile  height,  mg  is  the  variance  of  the  profile  slope,  and  mi, 
is  the  variance  of  the  profile  "curvature".  A  similar  set  of  momenta  can  be 
defined  for  the  two-dimensional  spectral  density t 


TJ  Pdpw2 


|v^/2)Z(r)|2 


As  indicated  on  the  right,  M q  is  the  variance  of  the  surface  height,  Mg  is  the 
variance  of  the  surface  gradient,  and  Ml,  is  the  variance  of  the  Oauasian  curva¬ 
ture  of  the  surface.  The  imediate  value  of  these  moments  in  determining  the 
functional  properties  of  surface  roughness  is  obvious. 

The  relationship  between  the  profile  and  area  moments  is  of  critical  impor¬ 
tance  in  the  measurement  and  specification  process  since  the  traditional  method 
of  measuring  surface  finish  is  by  means  of  a  stylus  gauge  —  which  measures  the 
surface  profile  —  while  the  functional  properties  of  surfaces  clearly  depend  on 
their  surface  properties. 

If  the  surface  roughness  is  weakly  stationary,  the  two  power  spectra  are 
related  —  by  £q.  (13)  —  and  therefore,  so  are  their  moments.  In  particular. 


r({l/2}  fn  4-  ID 
r(l/2)r({l/2}[n  +2]) 


where  I*  is  the  gamma  function.  This  result  says,  for  example,  that  mn  -  Mq, 
which  means  that  the  profile  roughness  equals  the  area  roughness)  mg  ■  Mg, 
which  means  that  the  variance  of  the  profile  slope  is  half  tnat  of  the  surface 
gradient;  and  raj,  ■  3/8  Ml  which  means  that  the  variance  of  the  profile  curvature 
is  three-eighths  of  the  Gaussian  curvature  of  the  surface.  All  this,  of  course, 
for  an  isotropically  rough  surface.  Straightforward  generalizations  to  aniso¬ 
tropic  surfaces  have  also  been  given. 

2.5  CLASSICAL  SOLUTION.  We  call  the  above  the  classical  solution  to  the 
characterization  problem,  ft  has  a  number  of  attractive  features:  1)  It  repre¬ 
sents  the  next  step  in  the  systematic  generalization  of  the  present  finish  stand¬ 
ards  which  includes  the  transverse  as  well  as  the  vertical  character  of  the 
roughness.  2)  It  is  stated  in  terms  of  a  set  of  finish  parameters  which  have 
direct  physical  meaning:  the  rms  values  of  the  surface  height  and  its  deriva¬ 
tives.  3)  It  provides  a  direct  relationship  between  profile  measurements  — 
which  are  easier  to  make  —  and  the  properties  of  the  surface  area  —  which 
determine  its  functional  properties.  And  U),  the  scheme  on  which  it  is  based  is 
readily  generalizable  to  include  higher-order  statistical  properties  —  if  and 
when  required.  In  fact,  if  the  roughness  can  be  taken  as  a  full  Gaussian  process, 
the  probabilistic  description  tp  arbitrary  order  may  be  described  solely  in  terms 
of  its  second-order  properties*^  in  that  case,  a  wealth  of  results  eonceming 
the  surface  can  be  expressed  in  terms  of  the  first  three  profile  moments,  mo*  mg 
and  mj,  discussed  above.  Examples  are  given  by  Longuet-Higgens  and  others. 
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3.0  RKAL  SHUKACdS. 


3.1  MECHANICAL  SURFACES .  The  traditional  method  of  measuring  surface  rough¬ 
ness  uses  a  stylus  gauge  to  determine  the  surface  profile  2(x).  This  is  usually 
used  for  "mechanical"  surfaces;  that  is,  surfaces  generated  by  machining,  grinding 
or  lapping  processes,  which  generally  lead  to  surfaces  with  vertical  roughness  of 
>  1  microinch  (25  nanometers). 

Such  surfaces  very  frequently  exhibit  power  spectral  densities  of  the  form 


wx(p)  “  P~2 


(1?) 


where  p  *  2"  /d  is  the  surface  spatial  wavenumber  and  d  is  the  surface  spatial 
wavelength. 5  This  result  says  that  the  profile  power  spectral  density  is  propor¬ 
tional  to  the  square  of  the  spatial  wavelength,  or  equivalently,  that  the  rough¬ 
ness  is  scale  invariant t  it  is  statistically  the  same  at  all  magnifications 

The  ubiquity  of  this  "inverse -square"  law  is  amazing,  and  a  number  of  models 
have  been  invoked  to  explain  it.  One  is  that  "machining"  processes  lead  to  sur¬ 
face  finish  involving  numerous  vertical  edges.”  In  other  wards,  the  surface  pro¬ 
file  can  be  viewed  as  a  kind  of  telegrapher's  signal.  However,  this  same  spectrum 
is  also  exhibited  by  Brownian,  Markov  and  autoregressive  processes,  and  its  appear¬ 
ance  in  nature  probably  lies  more  in  the  multiplicity  of  the  processes  that  exhibit 
this  behavior  than  in  a  common  physical  origin. 

3.2  OPTICAL  SUHffiACES.  Optical-quality  surfaces  generally  have  roughnesses 
of  <  1  microinch  and  are  difficult  to  measure  by  mechanical  stylus  techniques. 

An  alternative  method  is  to  measure  the  angular  distribution  of  the  intensity  of 
light  scattered  from  the  surface,  which  is  a  simple  mapping  of  the  two-dlmensi onal 
power  spectrum  of  the  eurface  height  when  the  vertical  roughness  is  much  less  than 
the  radiation wavelength . 

Although  this  art  is  in  its  infancy,  available  data  suggest  that  polished 
optical  surfaces  frequently  exhibit  spectra  which,  when  translated  into  the  one¬ 
dimensional  form,  correspond  to 


Wx(p) 


cc 


-1 

p 


(18) 


This  behavior  has  been  interpreted  in  terms  of  a  surface-tension  model  in  which 
polishing  is  viewed  not  as  a  cutting  process  per  se,  but  a  smoothing  operation 
which  minimizes  the  excess  surface  area  due  to  the  residual  surface  roughness. 
Superficially,  Eq.  (18)  resembles  1/f  electrical  noise;  another  ubiquitous  form 
which  also  derives  from  a  variety  of  physical  processes  in  nature.  Other  inverse- 
power-law  forms  have  been  reported  for  polished  surfaces  as  well.” 

3.3  LIQUID  SURFACES,  liquid  surfaces  exhibit  a  variety  of  height  dlstrlbu- 
tiona  depending  on  the  nature  of  their  excitation.  Simple  forms  for  the  power 
spectral  densities  appear  in  two  limiting  cases t  Capillary  waves,  which  are  gov¬ 
erned  by  surface  tension  and  lead  to  the  hyperbolic  form,  Eq.  (lb);  and  the 
"fully  aroused  sea"  which  exhibits  an  inverse-cube  profile  spectrum.® 
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Ttio  fact  that  mochanical,  polished  and  liquid  surfaces  all  exhibit  approxi¬ 
mate  inverso-powcr-law  spectra  with  integral  (or  half-integral? )  exponents  sug¬ 
gests  a  deeper  physical  connection  between  surface  roughness,  the  properties  of 
the  surface  modium,  and  the  roughening  process.  A  first  cut  at  understanding 
this  interesting  situation  has  been  given  in  terms  of  a  shot  model  of  surface 
roughness  coupled  with  the  asymptotic  properties  of  the  Fourier  integral.® 

3.1>  PRACTICAL  IMPLICATIONS.  The  irmediate  significance  of  these  results 
lies  less  in  their  specific  algebraic  forms  or  their  physical  origins,  than  in 
their  ill-behavedness  —  that  is,  they  can  cause  the  moment  integrals,  Eqs.  (Hi) 
and  (15)  to  blow  up  when  the  power  spectra  in  the  integrand  are  extrapolated  to 
very  small  or  very  large  spatial  frequencies. 

One  can  adopt  two  views  of  this:  The  classical  view  holds  that  this  situa¬ 
tion  is  an  artifact  of  the  measurement  process;  that  if  measurements  were  made 
over  a  sufficiently  wide  range  of  spatial  frequencies  the  spectra  would  ulti¬ 
mately  become  well  behaved  and  their  moments  finite.  The  radical  view  is  that 
intrinsic  surface  parameters  such  as  the  classical  spectral  moments  are  operation¬ 
ally  undefined  —  if  for  no  other  reason  than  that  the  spatial  wavelengths  in¬ 
volved  can't  be  smaller  than  atomic  dimensions  or  larger  than  the  size  of  the 
workpiece  —  and  the  real  world  can  only  be  discussed  in  terms  of  bandwidth- 
limited  values  of  those  parameters.  Further,  that  even  if  "intrinsic"  parameters 
could  be  defined  and  measured  by  some  extrapolation  procedure,  the  results  would 
be  of  no  practical  value  since  the  functional  properties  of  surfaces  —  as  measure¬ 
ment  processes  themselves  —  depend  only  on  a  limited  range  of  spatial  wave¬ 
lengths.  Other  workers  have  described  surfaces  exhibiting  such  ill-behaved  power 
spectra  as  non-stationary.5 

Philosophy  aside,  the  practical  implications  for  surface-finish  measurement 
and  specification  appear  to  be  the  following :9  l)  Surface-finish  parameters  are 
not  intrinsic  properties  —  one  cannot  speak  of  a  "1-micron  surface"  but  only  of 
a  surface  which  exhibits  a  1-micron  roughness  over  a  certain  range  of  spatial 
wavelengths.  2)  The  effects  of  bandwidth  limits  upset  the  classical  relation¬ 
ship  between  the  profile  and  area  moments.  3)  0*e  must  understand  the  details  of 
individual  measurement  procedures  to  be  able  to  specify  the  range  of  wavelengths 
included  in  the  measured  values,  both  for  specificity  and  to  ensure  valid  compari¬ 
son  with  other  measurement  processes.  And  U),  one  must  understand  the  functional 
properties  of  surfaces  in  enough  detail  to  be  able  to  estimate 
the  range  of  spatial  wavelengths  to  be  included  in  any  specifications  for  such 
surfaces. 


Items  1,  2  and  3  are  discussed  at  greater  length  in  the  following  sections 
of  this  paper. 


li.o  EFFECTS  OF  BANDWIDTH  LIMITS. 

ti.l  BANUTIDTH-LDriTED  MOIffiNTS.  Bandwidth -limited  values  of  the  spectral 
moments  are  defined  as 


.  ,pmax 
“namin’ dmax)  "  2J  dPw1<P)pn 
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and 
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(20) 


are  the  minimum  and  maximum  spatial  wavenumbers  included  in  the  definition  or 
measurement.  When  these  limits  go  to  zero  and  infinity  the  bandwidth-limited 
moments  become  the  classical  results,  Eqs.  (lit)  and  (15). 

It. 2  RI NO-SPECTRUM  SURFACE.  The  properties  of  the  bandwidth-limited  moments 
can  depend  sensitively  on  the  bandwidth  limits  and  the  shape  of  the  power  spectrum. 
To  illustrate  the  dramatic  effects  that  are  possible  we  begin  with  an  extreme 
example,  that  of  a  ring  spectrum: 


W2(p)  *  <5(p  -  Pq)  '  ; 

where  5  is  a  delta  function.*  Such  surfaces  could  be  realized  by  superimposing 
a  set  of  sinusoidal  corrugations  with  a  fixed  spatial  wavelength,  dQ  •  2w  /p q,  in 
random  directions  over  the  surface.  A  beam  of  light  striking  such  a  surface  nor¬ 
mally  would  then  be  diffracted  into  a  "ring'1  of  light  at  the  polar  angle  given  by 
the  grating  equation.  Sin  6  -  X  /dp,  where  X  is  the  radiation  wavelength;  hence 
the  name.  The  corresponding  one-dimensional  power  spectrum  is,  from  Eq.  (13a) i 

W^p)  -  2P(^/P2  -  p2  (22) 


for  p  <  pq»  but  zero  otherwise.  These  two  spectra  are  sketched  in  Fig.  2. 

The  striking  difference  in  their  form  is  due  to  the  wave-on-the-beach  phenom- 


*For  simplicity  a  normalizing  factor  with  the  dimensions  of  length-^  has  been 
suppressed  on  the  right  of  F.q.  (21). 
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The  classical  moments  of  the  ring-spectrum  surface  are  given  by 

Mjj  *  2l,Po+1  »  (2U) 

plus  the  moment  ratio,  Eq.  (16).  However,  the  bandwidth-limited  moments  are  very 
different. 

Look  at  Pig.  2,  and  begin  by  considering  the  situation  where  the  spectral 
window  falls  entirely  above  pgj  that  is,  pm^n  >  Pq.  In  that  case  both  the  pro¬ 
file  and  area  moments  vanish  and  the  surface  appears  to  be  perfectly  smooth.  If 
the  spectral  window  is  now  moved  downward  to  include  the  point  p  ■  pg,  that  is, 
Pjnin  <  Po<  pmax>  the  Profile  and  the  area  moments  will  be  nonvanishing,  al¬ 

though  they  only  satisfy  the  classical  moment  ratio  when  p^y,  "  0.  And  finally, 
when  the  spectral  window  ia  moved  entirely  below  pg,  that  is,  pg  >  the  pro¬ 

file  moments  are  still  nonvanishing  but  the  area  moments  are  identically  zero.  In 
other  words:  the  surface  is  rough  to  st;  as  measurements  but  smooth  to  scattering 
measurements  even  though  both  measurements  are  made  over  the  same  range  of  spatial 
wavelengths! 

Although  ring-spectra  surfaces  are  not  encountered  in  conventional  manufac¬ 
turing,  the  point  is  still  well  made  that  bandwidth-limited  finish  parameters  can 
have  very  different  properties  from  intrinsic  parameters  of  the  same  surface: 

Both  their  magnitudes  and  the  relationship  between  the  profile  and  area  proper¬ 
ties  are  affected.  Such  effects  are  examined  below  for  more  realistic  surface 
spectra . 


U.3  PCWER-LAW  SURFACES.  Suppose  the  profile  power  spectrum  measured  over  a 
limited  range  of  "spatial  frequencies  has  the  form 


w!<p)  *  P-8 

where  s  is  a  number.'*'  In  order  to  obtain  the  corresponding  form  of  W2  we  must 
know  Wi  to  infinitely  high  spatial  frequencies.  If  we  take  the  inverse  power-law 
form  shown  to  be  valid  into  that  unmeasured  region,  Eq,  (13b)  shows  that  the  cor¬ 
responding  form  of  the  area  spectrum  is 


W2(p)  «  F(s) • p 


where 


(26) 


F(a) 


r ( { 1/2  > [a  +  1]) 

r<i/2)r({ 1/2  >s) 


(27) 


*For  simplicity  a  normalizing  factor  with  th9  dimensions  of  length^-8 )  has  been 
suppressed  on  the  right  of  Eq.  (25). 
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Given  these  analytic  form:,  for  the  power  spectra  we  can  easily  calculate 
the  magnitudus  of  the  bandwidth-limited  spectral  moments,  Eq.  (IV).  Rather  than 
Giving  the  obvious  algebraic  forms,  values  are  givon  in  Table  1  for  selected 
values  of  s  and  n. 


In  all  cases  the  magnitudes  of  the  moments  depend  explicitly  on  the  minimum 
and  maximum  spatial  wavelengths  included,  and  diverge  in  the  limit  p^j  n  0 
and/or  Pmax  00  •  However,  the  precise  dependences,  and  the  sensitivity  of 
the  results  to  variations  in  the  finite  wavelength  limits,  depends  on  the  case 
considered.  For  example,  for  machined  surfaces,  s  ■  2fl  the  rms  profile  height, 
slope  and  curvature  scale  approximately  as  d-^3*,  and  dpin“3/2,  respec¬ 

tively. While  for  polished  surfaces,  s  ■  1,  they  scale  as  log'^d^ix/dmln)* 
‘Wn*’1  For  case3  of  interest  the  sirpe  and  curvature  parameters 

are  generally  moat  sensitive  to  d^,,,  as  expected  physically. 


I4.I1  PROWLS- AREA  RELATIONSHIP.  The  relationship  between  the  profile  and 
area  parameters  is  determined  by  the  moment  ratio  For  power-law  spectra 
this  ratio  is  simply 


-  - rW2) _  (28) 

Mh  r(i/2)r({i/2}[s  +  1]) 


which  has  the  values  of  1,  2/ir  and  1/2  for  s  *  1,  2,  and  3.  This  result  is  to  be 
compared  with  the  corresponding  classical  expression,  Eq.  (16). 

There  are  a  number  of  interesting  similarities  and  differences:  Both  re¬ 
sults  are  Independent  of  the  bandwidth  limits,  although  this  is  an  "accident"  in 
the  case  of  the  power-law  spectra.  The  bandwidth-limited  ratio  depends  only  on 
the  shape  of  the  power  spectrum,  i.e.  the  parameter  s,  and  is  independent  of 
the  moment  order  —  further  accidental  properties  of  the  power-law  spectra  — 
while  the  classical  result  is  independent  of  the  spectral  shape  and  depends  only 
on  the  order,  n. 

The  simplest  way  of  displaying  the  magnitudes  of  these  differences  is  to 
examine  the  ratio  of  the  surface  moments  derived  from  a  given  set  of  profile 
moments  using  the  classical  and  bandwidth-limited  ratios,  Eqs.  (16)  and  (28): 

Mclassical 

.  mil  +  ?]/2J_  .  r(s/2)  .  . 

Mbv-limiced  r((n  +  l)/2)  r < (e  +  1J/2)  *  ’ 

n 


Values  of  this  ratio  are  given  in  Table  2  for  particular  cases  of  interest.  They 
indicate  that  the  numerical  error  introduced  by  using  the  classical  recipe  is 
moderate  for  the  lower  moments  of  machined  surfaces,  s  •  2,  but  more  significant 
in  other  cases.  Once  again,  however,  these  results  apply  only  to  isotropically 
rough  surfaces  exhibiting  power-law  spectra  to  infinitely  high  frequency. 

If  the  measured  profile  spectrum  cannot  be  extrapolated  to  infinite  fre¬ 
quency,  the  corresponding  form  of  the  surface  spectrum  cannot  be  determined,  and 
the  profile-area  trfnslation  is  impossible.  This  follows  from  the  form  of  the 
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Abal  transformations  which  relate  and  Wg,  Sq.  (13):  W«(p)  depends  on  the 

form  of  W^(t)  for  all  t  >  p,  and  conversely.  This  situation  indicates  the 
essential  role  that  models  play  in  the  discussion  of  surface-finish  measure¬ 
ments  and  specification,  since  it  is  only  through  faith  in  empirical  or  physical 
models  that  one  can  justify  the  extrapolation  procedures  required  for  trans¬ 
lating  profile  data  into  area  data,  or  vice  versa.  The  ability  to  accomplish 
this  is  critical  since  present  finish  standards  are  given  in  terms  of  profile 
measurements . ^ 

I4.5  CLASSICAL  SURFACES.  The  classical  view  is  that  all  power  spectra  are 
naturally  well  behaved  and  only  appear  to  be  ill  behaved  over  a  limited  range  of 
spatial  frequencies.  Or  to  put  it  another  way,  the  ill-behavedness  lies  in  the 
use  of  improper  extrapolation  functions. 

Roughly  speaking,  the  high-frequency  tail  of  the  spectrum  determines  the 
properties  of  the  higher  moments  while  its  low-frequency  behavior  determines  the 
lower  moments.  In  the  case  of  power-law  spectra  discussed  above  a  simple  ex¬ 
trapolation  to  high  frequencies  was  sufficient  to  establish  the  profile-area 
connection,  but  the  classical  moments  still  diverge  either  at  their  upper  or 
lower  limits.  Such  behavior  is,  of  course,  classically  unacceptable. 

A  v-th  order  classical  spectrum  may  be  defined  as  one  whose  classical  pro¬ 
file  moments,  r^,  are  finite  for  all  n  <  2v  .  Thus,  a  first-order  classical 
surface  possesses  a  finite  height  variance,  but  no  higher  finite  moments.  A 
Cauchy  spectrum,  derived  from  a  simple  exponential  autocovariance  function,  is  a 
familiar  example  of  this  type.  A  Gaussian  spectrum,  on  the  other  hand,  is  an 
infinite -order  classical  spectrum:  all  of  its  profile  moments  are  finite. 

To  achieve  finite  classical  moments  the  measured  spectra  mu9t  be  extrapolated 
with  rounding  functions  which  kill  the  divergences,  especially  at  low  frequen¬ 
cies.  However,  those  rounding  functions  will  necessarily  involve  new  length 
parameters  —  such  as  correlation  lengths  —  in  place  of  the  window  wavelengths 
in  the  bandwidth-limited  case.  The  resulting  classical  moments,  although  finite, 
will  then  have  different  (larger)  magnitudes,  and  will  depend  on  different  sets 
of  physical  parameters  than  the  bandwidth -limited  values. 

The  preceding  discussions  were  principally  concerned  with  moment  ratios  in 
which  these  additional  parameters  do  not  appear.  A  detailed  discussion  of  a 
particular  rounding  of  the  low-frequency  tail  of  the  inverse-power-law  spectra  is 
given  in  the  Aopendix. 

$.0  SOURCES  OF  BANDWIDTH  LIMITS.  The  preceding  Section  illustrates  the  im- 
portance  of  bandwidth  limits  in  the  characterization  of  surfaces  with  ill-behaved 
power  spectra.  This  Section  discusses  a  number  of  sources  of  such  limits  in  com¬ 
mon  measurement  processes:  to  emphasize  that  all  measurements  are  inherently 
bandwidth-limited,  and  to  indicate  how  the  magnitudes  of  those  limits  arise  in 
practical  situations. 


$.1  SLiiC TRONIC  rT  Li*  "IKING.  Mechanical  stylus  measurements  involve  drawing  a 
fine  diamond-tipped  stylus  over  the  surface  being  measured,  converting  the  tip 
motion  into  an  electrical  signal,  and  analyzing  the  resulting  time  series  in  terms 
of  the  surface  profile,  Z(x).  However,  the  output  represents  not  the  true  profile 
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but  the  apparent  profile  Z'(x}: 


Z'(x) 


{ Z(x) } } 


(30) 


where  J  denotes  the  Fourier  transform  and  G  is  the  transfer  function  of  the 
measurement  system,  which  is  usually  dominated  by  electronic  filters  or 
"cutoffs".  The  power  spectral  density  of  the  apparent  profile  is  then 


W'(p)  -  MpiP-w^p)  (31) 

where  VJ^(p)  is  the  true  spectrum.  The  form  of  0(p)  depends  on  the  nature  of  the 
filter.  For  example,  for  a  simple  N-stage  RC  bandpass  filter  the  low-  and  high- 
frequency  cutoffs  behave  asymptotically  as  pt2NJ  respectively: 


~  -P  2>*  (32) 

-f 

$• 2  STYLOS  SMOOTHING.  Mechanical  styli  act  as  a  low-pass  filter  since 
they  tend  to  ride  over  height  fluctuations  with  spatial  wavelengths  smaller  than 
the  tip  radius. 

One  measure  of  the  high-frequency  cutoff  that  appears  frequently  in  the 
literature  is  obtained  from  the  model  of  a  circular  tip  riding  over  a  set  of 
sinusoidal  corrugations.  Simple  geometry  then  shows  that  the  requirement  that 
the  tip  track  into  the  valleys  is  equivalent  to  the  statement  that  the  maximum 
undistorted  spatial  frequency  is 


P 


max 


(aR) 


~/2 


(33) 


where  a  is  the  amplitude  of  the  corrugations  (half  the  peak-to-valley  distance), 
and  R  is  the  radius  of  the  stylus  tip.  For  example,  a  ■  1  u  and  R  ■  1  u  require 
that  ~  6  p. 

This  result  can  be  generalized  to  a  randomly  rough  surface  by  replacing  the 
geometrical  condition  by  the  requirement: 

"4  <  R"2  .  (3U) 


which  states  that  the  rms  profile  curvature  must  be  less  than  that  of  the  stylus 
tip.  A  sinusoidal  profile  has  the  spectrum  VJ  -  5  (p  -  p^  ),  which  leads 

precisely  to  the  geometric  result,  Eq.  (33).  On  the  other  hand,  the  spectrum 
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VJ  •  p~^  elves 


P 


max 


'V 


naxi 1/ 3 


(35) 


where  o2  Is  the  height  variance  and  dj^  is  the  maximum  spatial  wavelength  in¬ 
cluded  in  its  measurement.  For  example,  R  *  1  ji,  a  ■  1  p  and  d^y  •  10^  give 
diirj n  ~  8  p.  Similarly,  for  a  spectrum  of  the  form  W  »  p-1 


P  ^ 
roax 


(36) 


where  log  «  ln(dTnfly/dnrin).  For  the  same  parameters  as  above,  this  gives  dp^w  **  3  p< 

Interestingly,  although  each  of  these  three  examples  gives  values  of  the  min¬ 
imum  undistorted  spatial  wavelengths  which  are  of  the  same  order  of  magnitude  as 
the  tip  radius,  none  of  the  expressions  derived  predicts  proportionality  between 
dmin  and  Also,  the  non-lntuitive  form  of  the  factors  in  Eq.  (36)  is  more 
apparent  than  real:  the  quantity  log/o2  is  a  simple  constant,  as  seen  from 
Table  1. 


$.3  APERTURE  SMOOTHING.  Optical  stylus  measurements  produce  an  apparent 
profile  which  is  a  smoothed  version  of  the  true  profile.  In  one-dimensional 
terms 


Z'(x)  «  dcw(t  -  x)Z(t) 

. 


(37) 


where  u>  is  the  window  function  of  the  apparatus.  The  power  spectral  density  of 
this  smoothed  profile  is  then 


W|(p)  »  0<p).W1(p) 


(38) 


where  is  the  true  spectrum  and 

fl<p)  -  | J  dxeipxu(x) |2 


(39) 


is  the  window  transfer  function.  Such  smoothing,  of  course,  acts  as  a  low-pass 
filter  with  the  nominal  cutoff 


P  ^ 
max 


2s /A 


(Uo) 


where  A  is  the  width  of  the  smoothing  window.  'Hie  precise  form  of  the  transfer 
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function  depends  on  the  window  shapet  A  Oaussian  window  f»ives  a  Gaussian  0; 
a  rectangular  window,  a  Sine2  function}  a  cylindrical  window,  an  Airy  function, 
and  so  on. 

SJj  SURTBOARDIMQ.  Surfboarding  arises  from  the  fact  that  profile  measure* 
ments  are  usually  made  with  reference  to  a  local  rather  than  an  absolute  base¬ 
line.  That  la,  one  usually  measures  not  the  true  profile,  2(x),  but 


Z' (x)  -  Z(x)  -  (a  +  bx) 


Oil) 


where  the  term  in  parentheses  represents  the  least-squares-average  line  through 
the  individual  records,  The  apparent  power  spectrum  is  then  a  convolution  of 
the  true  spectrum  of  the  form^»^« 


W(p)  -  J  |  dqU  -  H  -  C)2Wl(q) 


<fc2) 


where  the  three  functions  in  the  kernel  are 


C  •  J0(pL  -  qL) 


(li3a) 


n  -  Jo(pL)j0(qL) 


U3b) 


C  -  Sjj^pL)  jx(qL)  (l<3c) 

and  the  j's  are  spherical  Bessel  functions.  The  5  term  corresponds  to  the 
finite  record  length,  2L;  the  n  term  to  the  removal  of  the  average  from  each 
record}  and  the  ?  term  to  the  removal  of  the  least-squares  slope  from  each  in¬ 
dividual  record  in  the  ensemble.  In  the  limit  of  very  large  record  length  the 
kernel  becomes  a  6  function  and  W‘  W  for  p  >  0,  as  expected.  Numerical  and 
analytic  evaluation  of  Eq.  (1*2 )  shows  that  surfboarding  acts  as  a  high-pass  filter 
which  cuts  off  spatial  frequencies  below 


^min 


Tt/L 


<Wi> 


where  the  shape  of  the  cutoff  depends  on  the  form  of  W. 

The  nature  of  this  effect  is  sketched  in  Figure  3.  It  is  called  surfboarding 
because,  in  effect,  the  baseline  for  a  finite  record  length  "surfboards"  over 
the  surface  profile  and  measurements  relative  to  that  baseline* are  Insensitive  to 
very  long  spatial  wavelengths.  In  fact,  it  is  readily  seen  from  Eqs.  (1*2)  and  (1*3) 
that  the  apparent  power  spectrum  vanishes  at  zero  spatial  frequency. 
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SUKFACE  SCATTERING.  Light  and  acoustic  scatterinc  offer  a  means  for 
the  direct  measurement  of  the  two-dimensional  power  spoctra  of  the  surface 
rouchness,  Wg.  In  such  measurements  tho  scattering  angle  is  related  to  the 
surface  spatial  wavelength,  d,  through  the  familiar  grating  equation: 


|sin6g  -  sin6i|  ■  X/d 


U5) 


where  and  eg  are  the  polar  angles  of  incidence  and  scattering,  respectively, 
and  X  is  the  radiation  wavelength.  Ibis  equation  follows  from  the  spatial 
invariance  of  the  electromagnetic  or  acoustic  equations  and  is  independent  of  the 
details  of  any  specific  scattering  theory. 

In  practice,  the  scattering  angle,  6a,  is  limited  to  a  maximum  of  v/2  — 
glancing  scattering  —  and  a  minimum  of  |8a  -  ^  X  /L,  where  2L  is  the  dia¬ 

meter  of  the  illuminated  surface  area.  Interestingly,  there  are  at  least  three 
mechanisms  that  determine  this  minimum  scattering  angle:  lhe  diffraction  limit 
corresponding  to  the  finite  illuminated  surface  aperture;  the  requirement  that 
the  maximum  spatial  wavelength  be  smaller  than  tha  record  length  to  ensure  sta¬ 
tistical  stability  of  the  measurements;  and  finally,  surfboarding,  as  described 
above.  This  last  enters  through  the  fact  that  the  record  mean  and  slope  are 
automatically  removed  in  aligning  the  scattering  apparatus  with  rsference  to  the 
centroid  of  the  specular  reflection  for  each  sample  area. 

For  the  usual  case  of  near-normal  incidence  the  spatial  bandwidth  limits 
involved  in  scattering  measurements  are 


2n  >  >  2n 

r  -  p  -  r 


U6) 


or  equivalently,  d^^  X  and  /v»  L.  Practical  limits  generally  fall  well 
within  these  extremes.  For  example,  for  a  beam  of  red  HeNe  laser  light  1  mm  in 
diameter,  d^  ~  1  p  and  d,^  100  y. 

6.0  SUMMARY,  CONCLUSIONS  AND  RECOMMENDATIONS.  Classical  discussions  of 
surface  topography  assume  that  the  power  spectral  densities  of  the  profile  height 
Wi(p),  the  profile  slope  p^Wj.(p),  and  the  profile  curvature  p^W^p),  are  Int#-  • 
grable  over  spatial  frequencies  from  zero  to  infinity.  This  leads  to  the  satis¬ 
fying  picture  that  the  rms  surface  height,  slope  and  curvature  are  finite, 
intrinsic  surface-finish  parameters;  and  further,  that  there  i«  s  one-to-one 
connection  between  profile  properties  and  those  of  the  surface  area. 

However,  there  are  two  eerpents  in  this  Eden  of  simplicity:  Measurements 
and  functional  properties  of  surfaces  are  sensitive  to  only  United  ranges  of 
surface  epatial  wavelengths,  and  many  surfaces  display  spectra  that  lead  to  n on- 
in  te  grable  power  spectral  densities  when  extrapolated  to  very  low  and/or  very 
high  spatial  frequencies. 

As  a  result: 
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1)  Tho  range  of  spatial  wavelengths  must  be  included  in  measured  or 
specified  finish  parameters?  for  specificity*  to  ensure  valid  comparison  with 
other  measurements,  and  for  functional  applications. 

2)  Measurement  techniques  must  be  analyzed  to  determine  the  range  of 
spatial  wavelengths  which  they  include?  or  better  said,  their  frequency  charac¬ 
teristics  or  transfer  functions. 

3)  Functional  properties  of  surfaces  must  be  examined  to  determine  the 
range  of  apatial  wavelengths  of  importance.  In  the  case  of  electromagnetic  and 
acoustic  scattering  this  function-finish  relationship  is  well  known,  but  in 
other  cases,  such  as  fricticn  and  wear,  the  connection  is  not  as  clear. 

Li)  Physical  models  should  be  developed  for  the  finish  generated  by  various 
manufacturing  processes.  These  models  are  necessary  for  the  further  understand¬ 
ing  of  the  generation  process,  the  simplification  of  the  specification  process, 
and  the  extrapolation  of  measured  spectra  into  unknown  regions.  Such  extrapo¬ 
lation  is  necessary  for  the  broadening  of  the  range  of  application  of  various 
measurement  techniques  and  for  relating  profile  and  area  specifications. 

$)  Bandwidth -limit  effects  discussed  in  this  paper  should  be  extended  to 
other  important  issues  involved  in  the  measurement  end  specification  processes 
which  have  been  omitted  in  the  present  discussion  t  such  as  two-dimensional 
measurement  techniques,  anisotropic  surfaces,  sampled  data,  and  statistical  sta¬ 
bility.^ 

7.0  APPBKDIX.  This  Appendix  describes  a  classical  trio  of  second-order 
statistical  /unciions  which  behave  as  the  inverse  power-law  spectra  discussed  in 
the  text  at  high  spatial  frequencies,  »  p~*,  but  which  possess  finite  spectral 
moments,  for  all  n  <  s  -  1.  These  functions  are  related  by  the  Fourier, 
Kankel  and  Abel  transformations  illustrated  in  Pig.  1. 

The  autocovariance  function  is 


COO  »  (at)  (U7) 

'  flr(s/2)  '“2a;  K[s  -  1J/2 V  }  1  ' 


where  r  is  the  gamma  function  and  K  ia  the  modified  Beesel  function.  DB 
is  a  constant  having  the  dimensions  of  length  to  the  (3-s)  power,  and  a  is  a 
constant  with  the  dimensions  of  reciprocal  length?  1/a  is  the  "correlation 
length"  for  this  class  of  functions. 

The  corresponding  one-dimensional  power  spectral  density  —  the  cosine 
transform  of  C  —  is 

Wi(p)  -  Ds(p2  +  a2)'fl/2  .  <1j8) 


The  two-dimensional  power  spectral  density  —  the  zeroth-order  Hankel  transform 
C  —  is 
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(1*9) 


W2(p)  -  DgF(s)  •  (p2  +  a2)~l8+l]/2 


where 


FC»)  “  ,  j)  .  (50) 


and  B  is  the  beta  function 


B(o,B)  -  r(o)r(6)/r(a  +  6)  .  (51) 

In  the  limit  p  a  —  that  is,  for  spatial  wavelengths  much  shorter  than  2  ir/a  — 
these  spectra  become  the  simple  power-law  forms  discussed  in  the  text*  Bqs.  (25) 

-  (27). 

Hie  classical  moments  of  the  spectra,  Eqs.  (1*8)  and  (1*9),  are: 


^  „  ,n+l  s-n-lv  _n+l-a 

VB(~  ,  2~)*« 


(52) 


and 


M 

n 


(53) 


The  ratio  of  the  classical  profile  moments  to  the  corresponding  bandwidth-limited 
values  is  then 


classical 

m  .  ad 

n  _  ,n+l  a-n-ly  ,  max^n+l-s 

bw-limited  8(  2  '  2  '  (  2ir  5  '  (5U) 

IB 

n 


in  the  limit  d-^^  ^min*  ratio  indicates  the  conjugate  roles  of  the  cor¬ 

relation  length  and  the  maximum  spatial  wavelength  in  determining  the  magnitudes 
of  the  classical  and  bandwidth-limited  moments,  and  the  fact  that  the  former 
moments  are  much  larger  than  the  latter. 

Two  special  cases  of  the  above  general  results  are  of  particular  Interest) 
s  •  1  and  s  •  2. 
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For  8  ■  It 


C(T)  -  20^(81) 

Wx(p)  -  D^p2  +  a2]"172 
W2(p)  -  (01/it)tp2  +  a2)"1 

and  for  s  •  2i 

C(t)  »  v (Dj/ a)e”aT 
W1(p)  -  D2[p2  +  a2]"1 

W2(p)  -  (D2/2)[p2  +  a2]"3 


(55a) 

(55b) 

(55c) 

(56a) 

(56b) 

(56c) 


These  two  cases  represent  zeroth-  and  first-order  classical  surfaces,  respectively. 
In  the  sense  defined  in  Section  2*. 5* 
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Figure  1.  Relationships  betweon  the  three 

second-order  statistical  functions 
C,  W]_  and  v/j>  of  the  random  vari¬ 
able  ?..  'J-  ,  (K.  and  0(.  stand  for 
the  Fourier,  Hankol  and  Abol 
transfoms  discussed  in  the  text. 
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Table  1.  Bandwidth-limited  values  of  the  profile 
and  area  moments  corresponding  to  the 
profile  spectra  shown*  Results  are 
given  in  the  limit  d,,^  »  d_^n.  Here 
log  ■  In  (dnjax/d^j,).  Dimensional  con¬ 
stants  have  been  suppressed. 
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1.  Introduction  and  Summary 

A  problem  of  fundamental  interest  to  practitioners  in 
reliability  is  the  statistical  estimation  of  the  reliability  of  a 
system  using  experimental  data  collected  on  subsystems.  In  this 
paper,  the  subsystem  data  available  consists  of  a  sequence  of 
Bernoulli  trials  in  which  a  "one"  is  recorded  if  the  subsystem 
functions  and  a  zero  is  recorded  if  the  subsystem  falls.  Thus 
for  each  of  the  k  subsystems  composing  the  system,  the  data  pro¬ 
vided  consists  of  the  pair  (n^.T^),  1*1,2,.. .k,  where  Y^  is 
binomially  distributed  (n^p^.  We  assume  that  ’^2  ’  ‘  ‘  *^k  are 

mutually  independent  random  variables. 

The  magnitude  of  interest  in  this  problem  is  easily  evidenced 
by  the  extensive  literature  devoted  to  it.  In  this  regard,  see 
the  survey  paper  by  Harris  (1977)  and  Section  10.4  of  the  book  by 
Mann,  Schafer,  and  Slngpurwalla  (1974).  In  addition,  the  Defense 
Advanced  Research  Projects  Agency  has  recently  issued  a  Handbook 
for  the  Calculation  of  Lower  Statistical  Confidence  Bounds  on 
System  Reliability  (1980). 

Historically,  the  first  significant  work  on  this  problem  was 
produced  by  Buehler  (1957).  However,  Buehler’s  method  as  des¬ 
cribed  in  that  paper  i6  difficult  to  Implement  computationally 
when  k>2. 

We  proceed  by  describing  Buehler’s  method  in  Section  2.  In 
Section  3  we  specialize  to  series  systems,  that  is,  a  system  which 
fails  whenever  at  least  one  subsystem  fails.  Sudakov's  (1974) 
results  are  extended  in  Section  4  and  employed  to  exhibit  some 
optimality  properties  of  the  L i nd s t r om-Madd en  method  (see  Lloyd 
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and  Llpov  (1V62))  for  constructing  lower  confidence  bounds  for 
the  reliability  of  series  systems  of  stochastically  Independent 
subsystems.  Some  numerical  examples  are  given  in  Section  5  and 
the  results  needed  for  this  generalisation  of  Sudakov's  Theorem 
are  provided  In  the  Appendix  to  this  paper. 

2.  Buehler's  Method  for  Lower  Confidence  Bounds 

A  system  composed  of  k  Independent  subsystems  is  said  to  be 
a  coherent  system  (with  respect  to  the  specified  decomposition 
into  subsystems),  if  the  system  fails  when  all  subsystems  fall 
and  the  system  functions  when  all  subsystems  function;  and  replac¬ 
ing  a  defective  subsystem  by  a  functioning  subsystem  can  not 
cause  a  functioning  system  to  fail.  Coherent  systems  are  des¬ 
cribed  in  Birnbaum,  Esary  and  Saunders  (1961)  and  Barlow  and 
Proschan  (1975) . 

To  any  system  one  can  associate  a  function,  h(p)  - 
h(Pl,p2* * ’ * ,pk^ *  °^pi-1’  • • • ,k,  where  h(p)  is  the  reliabil¬ 

ity  of  the  system  when  Pj^  is  the  probability  that  the  i— 
subsystem  functions.  It  is  well-known  that  if  the  system  is 
coher  ent , 

0  <  h(p)  <  1  , 

h ( 0 . 0)  -  0,  h(l . 1)  -  1  , 

and  h(pj,...,pk)  is  non-decreasing  in  each  variable. 

For  coherent  systems,  Buehler's  method  may  be  described  as 

follows:  The  observed  outcome  (y.,...,y.)  can  assume  any  of 

k  lk 

N  -  II  (n  +1)  values,  since  y  »  0,1,..  ,,n..  For  convenience, 

1*1  1  1  1 
ve  denote  n  ^  -  y  ^  by  *  ^ »  1*1,2,.. .,k. 
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A  partition  (A^ , A^  ,  .  .  .  ,Ag)  ,  s>l,  of  the  N  possible  outcomes 
is  said  to  be  s  monotonic  partition,  that  is,  ^i<A2<'*'<^s  ** 


(0,0,...,)  c  i  (n^^n^i 


,nfc)  c  Afi  and  if  x^  -  (x^^, • • • ,*lk> » 


*  ^X21  ’  *  *  * vith  Xj  ^  ^  *21* 


2k 


li 


‘21 


,k,  then  Xj  t  A^ 


implies  x,  C  Aj.  J  >  i. 


Let 


f  (x;  p)  -  p-(X-x)  -  II 
p  i-1 


k  /n^  n,'*,  x 


Pi1  S/  -  n 

1  1  i-1 


k 


Pi  ^i 


(2.1) 


and  for  1  <  n  <  s-1,  let 

an  "  inf{h(P> 


I  f(x  ;p)  -  aj 

.  <  <n  1  / 


(2.2) 


and  a 


0. 


xiEAi» 


Each  such  partition  may  be  identified  with  a  function  defined  on 
the  set  of  sample  outcomes  by  defining  the  ordering  function 
g  (x ) ,  where 

g ( x )  -  n  if  x  G  An  ,  lfn<s  J  (2.3) 

obviously  g(x)  inherits  the  monotonicity  properties  of  the 
partition. 

Subsequently  it  will  be  convenient  to  use  ordering  functions 

g(x)  6uch  that  the  range  of'  g(x)  will-be  a  finite  set  of  reel 

numbers,  r,<r,<.  .  .<r  .  With  no  loss  of  generality,  we  can  identify 

12  S 

A  by  defining  A ^  -  jx|g(x)  -  r^J,  i"l»2,...f«. 


the  sets  A 
now  establish  the  following  theorems. 


We  can 


Theorem  2.1.  Let  %  be  distributed  by  (2.1).  Then  a  is  a 

- _  g 


(1-a)  lower  confidence  bound  for  h(p) 


If  is  *)®o  *  (1-a) 


lower  conf id enc e  bound  for  h(p),  then  bj<s^,  l<i<®. 


Proof:  Fix  p  and  let  n(p)  be  the  smallest  integer  such  that 
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f  n(p)  ) 

'tV  c  ±vmX  Ai  1  - 


(2.4) 


pH  E  V  .  A  }  >  1-C 
Pl  i-n(p) 


(2.5) 


».  ■  (5|pp{5  E  j/J  -  “} 


(2.6) 


Then  D  is  a  1-a  confidence  sec  for  p,  since 

gV  X) 


c  Dg(x)}  “  pp{8(i)  -  n(p)}  -  1'a  * 


(2.7) 


This  establishes  the  first  pare  of  the  conclusion.  Further,  since 

h(p)  is  continuous  and  0<p^<l,  the  infimum  in  (2.2)  is  attained. 

Now  assume  that  1.  is  the  smallest  index  such  that  b.  >a  , 

11 

l<i  <s-l.  Then,  for  some  pQ,  p^, 

bi  >  inf|h(p) |  J  f(x;p)  -  a}  -  h(po)  » 

1  *iCAi,iiil 


I  fCx.P,)  >  a,  h(p.)  <  b 

xicA1,i<i1  x  X1 


Therefore 


)  <  b 


g  no 


x1cAi ,  i<  i^ 


f (xjPj)  >  a  , 


a  contradiction. 


Remark.  Let  d  •  aup(l-h(p)|  J 

*  |xieAi>i<n 


l  f (x.;p)  -  a). 
.  .  i<n  1  > 


Then  d  is  s 
n 


(1-a)  upper  confidence  bound  for  l-h(p),  the  unreliability. 

Let  A  -  |xcEk,  0<xi<a1,  i-l,2,...,k|  and  let  g(x)  be  contin¬ 
uous  on  A  (the  closure  of  A)  and  strictly  increasing  in  each 
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lor  «A.  «(S)  •»  »  >>«  “  *»  "4"i"S  ‘U"Ct‘°'' 

«,  described  lmmedi.tely  preceding  Theorem  2.1.  We  require  the 

following  additional  property  of  g(x). 

Fix  x0eA*  Let  B(*0)  41  SC*!*0*- •  *  *0)  "  *1*  Then 
g(Ji,0 . 0)  -  s(i0)  hee  e  unique  eolutlon  In  Pj-  Proceeding 

recursively,  let  »,  «  ^  «4  *•*»«  *2  ‘  »,«»>  **  th*  ,0lUtl°“ 

of  g(i0)  -  gdg.yj.o . o)-  ‘•c>‘  liJiv  *n'1  S-i  i  yi-v 

.  <  v  i  <  y,  ,  let  y.  ■  y  s  (i«  » lo  *  *  *  *  ’  -1^  t*'e 

j  -2  -  yj  -2’  ’  “  ’  1  -  1  J  }  * 

solut Ion  of 


g(Jc)  -  * 

We  require  that  the  equations  indicated  in  (2.8)  have 
unique  solutions  for  each  y^ . 


(2.8) 


Then  define 


l yx 3  ly2) 


|ykl 


^  Ik 

F(x  ;p)  m  1  I  •••  l  f(l»5)  * 

o  j  -  a  i  .  a  d  ■  n 


(2.9) 


ij-0  i2-o 


ik-o 


where ,  for  J>1.  . 4j -1*  *  L*C 


f*(J  :a)  -  »«P  F(5  ;p)  .  o<s<i 

0  h(7)-a 


(2.10) 


Then  we  have 

Theorem  2 . 2 .  U  i„  s.tl.fle.  ^  £*<V>  ’  »•  ‘V>  ’  1 

and  f*(xo;a)  is  a  strictly  increasing  function  of  a,  and  if 


x  e  A  where  g(x)  determines  <AX , , . . . , Ag) ,  and  if 
o  n 

b  -  inf (h (p) J  l  f  (*.,5)  *  Of  » 

1  I  x  e  A  .  ,  i  <n  1  J 


then  we  have 


f  *  (x  :  b)  -  O  . 
o 


(2.11) 
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proof :  Since  the  infimum  in  (2.11)  is  attained,  there  is  a  pft 

such  that  b  -  h(p0)  and  F(*0»P0>  -  a.  Then  f*(*o,b)  >  a.  If 

f*(x  , b)  >  a,  there  exists  P, ,  with  *  -  h(p  ),  a<b  and 

o  *  ■ 

f*(x  ;a)  *  a  contradicting  (2.11). 
o 

Obviously,  the  above  discussion  can  easily  be  modified  to 

obtain  upper  confidence  bounds  on  the  unreliability  l-h(p)  by 

*  _ 

replacing  inf  by  sup  in  (2.11)  and  requiring  that  f  (xQ;a)  be  a 
strictly  decreasing  function  of  a,  0<a<l. 

3.  Applications  to  Series  Systems 

k 

For  a  series  system  h(p)  “  II  p..  Further,  throughout  this 

i-1  1 

section  we  assume  that  g(x)  satisfies  the  conditions  necessary  to 
Insure  that  the  solutions  for  ylt...,yk  Indicated  in  (2.8)  are 
unique.  Then  we  have  the  following  theorem. 

Theorem  3.1.  If  h(p)  -  II  p,  ,  then  inf  f  (x_;a)  -  0, 

-  i-i  1  0<a<l  ° 

sup  f  (x  ,a)  -  1  and  f*(x  ;a)  is  strictly  increasing  in  a, 

0<a< 1  °  ° 

whenever  xo  -  (xq1 , . . . , xQk)  satisfies  xeJ  <  n^  ,  J*l,2,...,k. 

Proof .  Since  h(p)  •  1  if  and  only  if  p^  -  1,  i-l,2,...,k,  it 
follows  from  (2.1)  that 

lim  sup  F(x  ;p)  ■  1  . 
a-*-l  h(p)-a  0 

Similarly,  h(p)  ■  0  if  and  only  if  at  least  one  p^  "  0, 
i-l,2,...,k.  Since  F(xo;p)  <  p^|Xi<nij  *  1"Pp{Xi"ni}  “  1"qi1, 
we  have 

11m  sup  F (x  ; p)  »  0  . 
a-*-0  h(p)*s 

* 

To  show  that  f  ( x  o ;  a )  is  strictly  increasing  in  a,  consider 
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0<a<b<l  and  let  pfl  -  (Pal » •  •  •  *PaV>  satisfy  f*(xo;a)  -  F(*0»Pa>’ 

-  *  -  -  » 

Similarly,  let  p.  satisfy  f  (x  :b)  •  F(x  :p.).  Let 

D  O  0  D 

1  *  » *2 ’ *  *  *  * any  non_e“Pty  set  of  indices  such  that 

p  (— <  1  and  let  IC  be  the  remaining  indices.  Then 

ai .  a 


(  H  p 

JCI 


el 


j 


)  n 

J  elc 


b  . 


(3.1) 


From  the  monotone  likelihood  ratio  property  of  the  binomial 
d 1st  r ibut ion , 

F<*o:?a)  <  F(5o:f*)  » 

* 

where  the  components  of  p  are  given  by  (3.1).  Then 

f(x  ;p*)  <  sup  f(x  ;p)  -  "  f*(x„;b)  . 

O  —  ^  o  O  D  O 

h (p) «b 


4.  Sudakov's  Method 


Let 

Vr>,)  "  5T7T7T  I]  • 

Then  if  y  is  an  integer,  y<n,  we  have 


v  /n\  n-ii  _  . 

L  (*)  P  q  -  I  (n-y,y+l)  . 

i-0  1  P 

For  0<y <n ,  real,  define  u(n,y,a)  by  a  -  I  .  v(*j-y,y+l). 

Thus,  for  integer  values  of  y,  u(n,y,o)  is  a  lOO(l-a)  percent 

lower  confidence  limit  for  p.  Sudakov  (1973)  showed  that  for 

k 

n  <n  <...<n  and  g(x)  -  II  (n  -x.), 

A_  ~  i«l  1  l 


u(n1,y1,ot)  <  b  <  u(n1,Iy1),ft)  , 
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vher  e 


i  ■  w  ’o  ■  1  -  lH1((nr,<oi)/,'i)  • 


u(n^,7^,a)  is  called  the  Llndstrom-Madden  method  for  determining 
lover  confidence  limits  for  the  reliability  of  series  systems 
(see  Lloyd  and  Lipov  (1962)). 

Lipov  and  Riley  (1959)  used  a  different  ordering  function; 
nevertheless  they  noted  that  for  "small"  n^,  their  tabulated 
value6  provided  good  agreement  with  the  results  using  the 
Llndstr om-Madden  method.  For  large  values  of  n^,  the  tabulated 
values  that  they  provided  are  based  on  the  Llndstrom-Madden 
method.  Here  ve  provide  a  further  justification  for  the 
Llndstrom-Madden  method  by  establishing  that  it  provides  conserva¬ 
tive  lower  confidence  limits  (l.e.  is  a  lover  bound  to  b  defined 
in  (2.9))  using  the  ordering  function  g(x)  employed  by  Sudakov 
and  ve  also  obtain  an  upper  bound  for  b,  thus  determining  the 
possible  error  of  the  Lind s tr om-Madd en  method. 

Sudakov's  proof  is  unnecessarily  complicated  and  contains 
some  Incorrect  assertions,  which  nevertheless  do  not  affect  the 
validity  of  the  conclusion.  In  the  Appendix  ve  provide  a  simpler 
proof  of  some  auxiliary  results  needed  for  the  generalization  of 
Sudakov's  theorem  given  below. 


Theorem  4.1.  Let  g(x)  satisfy  the  hypothesis  of  Theorem  3.1. 

Then, 

b  <  min  u(n  ,[y*),a)  ,  (4.1) 

1< i<k  1  1 

where  b  is  given  by  (2.11)  and  y*  -  y±  Oj  » $  2  »  *  *  *  i-i>  i#  evaluated 
at  J  ^-O ,  i“l , 2 , . . . , i-1 .  Note  that  y^  ■  y£.  If  we  also  have 
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h'-h  >  ini  . 

°j'*j  '  "j+i 


then 


u(ni.71,o)  <  b 


(4.2) 


(4.3) 


Proof :  (4.1)  Is  Immediate  from  (2.11)  upon  setting  p^-1,  *t»d 

solving  F (xQ ; 1 . 1»P^»1**».»1)  *  o •  Recall  that  n^  ^  Oj  5  •••  ^  ® 

and 


lyj 

F(x  ;p)  -  I  b (n. -1. ; p. , n. 
°  l1-0  x  x  a  a 


) 


<yk-lJ 


.  b(nk-i'1k.i;pk-i»nk-i)IP.  (tvlyk]’[yk]+1) 

ik-i"° 


...  I 

Nov,  apply  Lemmas  Al,  A2,  and  A3  to  the  Innermost  sum  In  (4.4),  to  g< 
Iyk-iJ 

I  b(nk-r1k-ii,’k-i'”k-i,Ipk <n^-I^'k)•|5,k]+1,  i 

ik-l”U  K 

‘^k-i1 

£  b<nk  l'1k  l  ’  pk  l»Dk  l^p  ^nk”yk,yk+1^  < 

‘k-r0  k 

lyk-l’ 

J  b ^nk-l_ik-l ' Pk-1 ’ °k-l^ 1  ^nk-l”yk-l ,yk-l-ik-l+^^  - 

l  *0  pk  * 

xk~l 

I  _  (n.  .  -y.  .  ,y v  .+1)  • 
pk-lpk  k~*  k‘1  k  1 

Repeated  applications  of  the  above  establish  that 


f(xq;p)  <  i  k  (nj-y^yj+D  • 

n  ?i 

i-i  1 


(4.5) 


(4.3)  follows  immediately  from  (4.5),  completing  the  proof 
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.  -  ,‘ki  f  j  jji i  ■  H'  mi/~  irt— ... 


Remarks . 


If  (A. 3)  holds  and  is  an  integer,  then  b  -  f(n^ty^,o). 

It  has  often  been  suggested  (Lloyd  and  Llpov  (1962), 
Winterbottom  (1974),  Bolshev  and  Lo ginov (1966) ,  Mirnly  and 
Solov’yev  (1964))  that  the  confidence  level  should  depend  only  on 
n7 ,  the  smallest  sample  size.  We  now  provide  a  numerical  Illus¬ 
tration  to  show  that  the  bound  in  (4.1)  may  be  improved  by  taking 
all  the  n^'s  into  consideration. 

Let  k-3,  a-.l,  n  -  (10,12,30),  x  «  (0,3,0).  Then  for 
3 

g(x)  -  n  (ni-x1)  ,  {(n^^.Iyjl.o)  -  .541,  f(n2»(y2),a)  -  .525, 
f(n^ly3l»a)  *  .639.  The  use  of  (4.3)  establishes  .500  <  b  <  .525. 

Note  that  if  x  .  •  n . ,  for  some  i,  l<i<k,  then  g(x)  -  0  and 
b* 0 .  It  seems  reasonable  to  use  b- 0  as  the  lover  confidence 
limit  whenever  xo^“n^  ^or  any  monotone  ordering  function  satisfy¬ 
ing  the  conditions  of  Section  2. 

k 

We  now  show  that  if  g(x)  •  II  (n.-x.)f  then  (4.2)  is  satis- 

i-1  1  1 

fied  and  Theorem  4.1  applies.  This  result  will  extend  a  result 
due  to  Wlnterbottom  (1974),  who  established  thla  fact  for 
particular  special  cases.  In  addition,  we  will  also  show  that 
(4.2)  holds  for  a  number  of  other  ordering  functions  used  in  the 


literature. 


Theor em  4.2.  Let  g(x) 


ni+lai  -  ai+lni’  1-1,2 


k 

II  (n.-x.+a.),  where  a  >0  and 
i-1  1  1 

• ,k-l.  Then  (4.2)  is  satisfied. 
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then  we  have 


(ni’yi+ai)(ni+l+ai+l)  “  (ni_ki+ai) t®i+l“,i+lfoi+l*  » 


establishing 


yl+l 

Di+1  (ni+l+Qi+l) (ni“ki* 


Thus  (A. 2)  holds  if 


^ni+i+ai+i^ni"ki* 

this  last  inequality  will  be  true  whenever  n^+^ai  >  a^^n^.  *n 
particular,  this  is  valid  when  a^«0,  i  ■  k  which  Is 

Sudakov's  ordering  function. 

k 

Theorem  4.3.  If  g(x)  •  1  -  £  x./n  ,  then  (4.2)  is  satisfied. 

i-1 

1c  y 

Proof.  If  1  -  y./Dj  “  c  “  1  -  —  -  —  *-■ ,  then 
-  i  i  »i  "i+1 


or 


y 

n 


i+1 

i+1 


This  type  of  ordering  function  has  been  employed  by  Pavlov 
(1973),  for  example. 

k  2  i 

Theorem  4.4.  Let  g(x)  ■  Y  a  x  +z  (a  x.)  ,  where  z  satisfies 

iloli  a 

1  ~  4>  ( at  )  “  a  and  $(x)  Is  the  standard  normal  distribution  function. 

a  k 

R  —1 

>  a2  >•  •  mt  and  J  1/n^)  .  Then  g(x)  satisfies 

(4.2)  if  and  only  if 
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<«.*) 


(Wl)yJ  “  (VaJ+l)*a+*jkj“aJkj(*a*J+2c"aj(yj+kj>>  ‘ 

jv1  iv1  2 

Proof :  If  g(x  )  ■  c  +  l  a  k  ,  then  defining  £  a  k  •  c., 

'  “  °  i-1  i-1  11  A 


•jyj  +  la(cl+aJyj>*  “  c  (A . 7) 

end 

*jkj  +  *j+iyj+i  +  *a(ci+*jkj+*j2+lyj+i)i  •  c  •  <*•'> 

Equating  the  left  hand  sides  of  (4.7)  and  (4.8),  we  obtain  (4.6). 
If  k-2,  (4.6)  holds  for  all  cases  of  Interest. 

If  (4.6)  holds,  then  setting 


1-a 


<r(*)) 


X|f (x,l-a) 
0 


x-1  -t 
t  e 


dt 


» 


a  straightforward  limiting  argument  shows  that 


max  a1f ( [y1J+l, 1-a)  <  b  <  a x f (yx+l , 1 -a)  .  (4.9) 

Thi6  ordering  function  has  been  used  by  Johns  (1976)  and  b  in 
(4.7)  is  the  value  tabulated  by  Johns  for  k-2.  The  validity  of 
the  lower  bound  does  not  depend  on  (4.6).  In  Table  1  below,  the 
lower  and  upper  bounds  given  in  (4.9)  are  tabulated  along  with 
the  values  given  by  Johns  for  a-.l.  These  refer  to  upper  confi¬ 
dence  limits  for  the  Poisson  parameter  combinations  ai^i+*2^2* 

Note  in  particular  that  three  of  the  values  tabulated  by 
Johns  (indicated  by  asterisks)  violate  (4.9).  Specifically  consider 
5.24,  in  which  case  I y x 3  -  5,  since  g*(2,5)  -  4.78,  g*(5,0)  -  4.72 
and  g  (6,0)  —  5.48.  Using  the  Poisson  approximation  we  obtain  the 
value  9.275  for  the  upper  confidence  limit  to  X  for  a- . 1  and  thus 
al*l+a2^2  *  5.56.  Consequently  the  sup  must  exceed  5.56.  An 
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alternative  approach  to  the  one  suggested  by  Johns  for  k  >  3  is  to 
simply  use  a^f (y ^+1 , l-a)  for  b. 


Table  1 


Comparison 
With  Values 

of  Upper 
Tabulated 

and 

fey 

Lower  Bounds 
Johns  for  o« 

.1 

*1 

*1 

*2 

Lower 

Bound 

Upper 

Bound 

Johns ' 

Tabled  Value 

.9 

7 

2 

4.79 

5.50 

5.17 

.9 

3 

0 

2.07 

2.27 

2.16 

.75 

6 

3 

6.00 

6.65 

6.23 

.75 

12 

3 

7.90 

8.29 

7.91 

.67 

3 

3 

5.36 

5.61 

5.33* 

.67 

15 

2 

8.71 

9.24 

8.81 

.60 

5 

2 

5.56 

5.62 

5.24* 

.  60 

7 

6 

9.24 

9.53 

9.18* 

5.  Numerical  Examples  and  Concluding  Remarks 


Examples  1  and  2  illustrate  the  method  we  have  described  in 
this  paper. 


k 

Example  1:  Let  H(x)  -  II  (n  -x  )  ,  a  •  .05,  k  »  5,  n  - 

i-1  x  1 

(20,30,40,25,60),  x  ■  (2,6,10,8,15).  Then  the  951  upper  confi¬ 
dence  limit  for  the  failure  probability  is  contained  in  (.86, .88). 


Example  2:  Let  H(x)  -  JUn^-x^),  &  -  .05,  k  ■  2,  n  -  (10,10), 
x  -  (3,2).  Then  the  951  upper  confidence  limit  for  the  failure 
probability  is  contained  in  (.70,  .73).  The  value  given  in  Lipov 
and  Riley  (1959)  is  .70. 


Remarks .  In  this  paper  we  have  showed  that  the  Lind strom-Madden 

\ 

technique  is  conservative  for  ordering  functions  satisfying  (4.2). 
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Further,  If  is  an  Integer,  then  the  Llndatron-Madden  nethod  is 
exact.  We  have  also  relaxed  the  conditions  needed  In  Vinterbotton 


(1974)  and  provided  an  alternative  to  the  nethod  of  Johns  (1976). 


The  auxiliary  results  employed  in  the  proof  of  Theorem  4,1 
are  provided  here. 

Lemma  Al:  I  (n-x,x+l),  0<y<l,  is  a  decreasing  function  of  n  and 

- -  y  “  ~ 

an  Increasing  function  of  x.  1^ (np , nq+1) ,  p+q  •  1,  0<p<l,  is  an 
increasing  function  of  q. 

Proof :  The  proof  is  immediate  from  the  observation  that  the 

beta  distribution  with  parameters  a  and  (J  has  monotone  likelihood 
ratio  in  a  and  -6  and  that  if  a  probability  distribution  has 
monotone  likelihood  ratio  In  6,  Fg(x)  Is  a  decreasing  function 
of  0  (Lehmann  (1959),  p.  68  and  p.  74). 

yi‘ki  yi+l 

Lemma  A2:  If  - - —  >  r -  a*>d  <  nj,,»  then 

-  ni-k1  -  ni+1  i  -  i  +  1 


(A.l) 


Proof :  Rewriting  the  left  and  right  hand  Bides  of  (A.l)  as 


T  -  /  ,  Li±lx  ^l-H'  .  . 

^  ^  J  I  1  "  "Z  ’  )  »  I  <  ( 

y  i+1  n i+1  1+1  ni+l 


(A. 2) 


Lemma  Al  applies  and  the  conclusion  follows. 


Lemma  A3;  Let  y ^y ^  "  y»  0<y^<l,  i*l»2 


Then 


I  v  (n-x , x+1)  >  l  b(n-k;y.,n  )1  (n-x,x-k+l)  .  (A. 3) 

/  J  /  2  *  1 - ft  •*-  Jr* 


426 


Proof  : 


k-0 


O’ 


n-k 

1 


(1-: 


T(n+1)  (1'yi 

•  r feo  kf0  "kTro 


ru+n  fyiy2  lf5 

r(n-x)  j  kf0 

0 


Thus  (A. 3)  will  hold 


T(n+1)  [yl 

r(n-x)r(x+l)  J 


r  (n+1)  fyly2 
F (n-x)  J 
0 

or 


r<n+ll-  f7 

r (n-x) r (x+i)  J 
o 
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Optimal  Upper  Confidence  Limits  for  Products  of  Poisson 
Parameters  with  Applications  to  the  Interval  Estimation  of  the 
Failure  Probability  of  Parallel  Systems 

*  ** 

Bernard  Harris  and  Andrew  P.  Soms 

Abstract 

The  problem  of  obtaining  optimal  upper  confidence  limits  for 
systems  of  independent  parallel  components  is  treated.  Exact 
optimal  upper  confidence  limits  are  obtained  for  an  arbitrary 
number  of  components  for  specified  failure,  combinations.  For 
a  small  number  of  failures,  bounds  on  the  upper  confidence  limits 
are  obtained.  For  an  arbitrary  number  of  failures  an  approxima¬ 
tion  is  given  which  is  justified  numerically  and  asymptotically. 
The  results  of  this  paper  are  compared  with  the  results  given  by 
Buehler  (1957)  and  some  numerical  examples  are  presented. 
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1.  Introduction  and  Summary 

A  problem  of  fundamental  interest  to  practitioners  in  relia¬ 
bility  is  the  statistical  estimation  of  the  reliability  of  a 
system  using  experimental  data  collected  on  subsystems.  In  this 
paper,  the  subsystem  data  available  consists  of  a  sequence  of 
Bernoulli  trials  in  which  a  "one"  is  recorded  if  the  subsystem 
functions  and  a  zero  is  recorded  if  the  subsystem  fails.  Thus 
for  each  of  the  k  subsystems  composing  the  system,  the  data  pro¬ 
vided  consists  of  the  pair  ,  i«l,2,...,k,  where  Y^  is 

binomially  distributed  (n^,p^).  We  assume  that  Yj ,Y2 , . . . ,Yk  are 
mutually  Independent  random  variables. 

The  magnitude  of  interest  in  this  problem  is  easily  evidenced 
by  the  extensive  literature  devoted  to  it.  In  this  regard,  see 
the  survey  paper  by  Harris  (1977)  and  Section  10.4  of  the  book  by 
Mann,  Schafer,  and  Singpurwalla  (1974).  In  addition,  the  Defense 
Advanced  Research  Projects  Agency  has  recently  issued  a  Handbook 
for  the  Calculation  of  Lower  Statistical  Confidence  Bounds  on 
System  Reliability  (1980) . 

Historically,  the  first  significant  work  on  this  problem  was 
produced  by  Buehler  (1957)  .  However,  Buehler's  method  as  des¬ 
cribed  in  that  paper  is  difficult  to  implement  computationally 
when  k>2. 

In  this  paper,  we  examine  the  problem  of  obtaining  upper 
confidence  limits  for  products  of  Poisson  parameters.  This 
problem  is  studied  by  means  of  majorization  methods  and  Schur- 
convexity,  such  as  described  in  the  book  by  Marshall  and  Olkin 
(1979).  A  significant  application  is  the.  determination  of  confi- 
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dence  limits  for  the  reliability  of  systems  of  k  parallel  sub¬ 
systems,  a  fundamental  problem  in  the  statistical  analysis  of 
reliability. 


2.  Exact  Solutions  for  Products  of 
Poisson  Parameters  for  Small  Failure  Combinations 

Let  X  -  (X. ,X- , . . . ,  X.  )  be  independent  Poisson  random  variables 
1  2  K  k 


with  parameters  X, , X- , . . .  , X.  ,  k  >  2,  and  let  h(X)  «  n  X.. 

1  2  K  -  i-1  1 

k 

g(x)  -  n  (x.+d)  ,  l<d<1.5  ,  x.=0,l,... 

i-1  1  1 

and  denote  the  ordered  points  in  the  range  of  g(x)  by 
j1<j2<...<j|n< -  Define 

Ai  -  |x|g(x)  -  jjJ  . 


Let 


(2.1) 


(2.2) 


Since  x^  i=l , 2 , . . .  ,k,  takes  on  non-negative  integral  values, 
we  regard  it  as  desirable  to  have  d  in  (2.1)  only  assume  non¬ 
integer  values.  This  has  the  effect  of  making  the  partition 
defined  in  (2.2)  finer  than  would  be  the  case  if  d  were  an  integer. 

It  is  easily  verified  that 


(2.3) 


a  -  sup(h(X)|  l  f  (x . ;  X)  -  al 
n  l  x^eAj’,  i<n  1  / 

is  a  (1-a)  upper  confidence  limit  for  h(X),  where 

-I  k  Xxi 

f(xjX)  -  e  1-1  n  ~T"  *  X,>0,  x.-0 ,1, . . .  .  (2.4) 

i=l  xi*  1  1 


4 


The  proof  is  identical  with  that  given  in  Harris  and  Soms  (1980). 
Note  that  if  x  is  fixed  as  n.-*»,  i»l,2,...,k,  then 


lim  q  n  n^  where 

n-.ce  i«l 
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t  K  I  K  fT\.  \  n,-X.j  X.  .  \ 

q  *  supj  n  qil  l  n  3  3  qi13  »  a) 

^i-1  1|xieAi,i<n  j«l  txij'  3  1  > 


Thus  in  practice  a  /  n  n,  may  be  employed  as  an  approximate  (1-a) 

n  i»l  1  k 

upper  confidence  limit  for  n  q.,  q4“l-p. .  In  this  sense  the 

i«l  1 

methods  of  this  paper  can  be  used  as  approximations  for  estimating 
the  reliability  of  parallel  systems  when  independent  binomially 
distributed  data  is  obtained  for  each  component. 

We  proceed  by  showing  that  g(x)  is  a  Schur-concave  function 


and  consequently 


Bx  “ 
xo 


jx|g(x)  <  g(xQ)} 


is  a  Schur-convex  set  (see  Marshall  and  Olkin  (1974) ,  pp.  1189-90 
and  Nevius,  Proschan  and  Sethuraman  (1977),  p.  264).  The  Schur- 
concavity  of  g  (x)  follows  immediately  by  noting  that 


c  -x  )  fisijl  . 

C1  x2'  [  3xx 


isiil 

3x_ 


<  o  . 


Define  F(£  ;X)  by 
o 


F(xq;X) 


l  f(xi?X)  -  P^ (B~  ) 


xic8x 


(2.5) 


and  let 


u(x  ;a)  «  sup  F(x  ?X)  ,  0<a<l  . 

°  h(X)-a  ° 


(2.6) 


Since  the  Poisson  distribution  has  a  monotone  likelihood  ratio, 
u(xQ;a)  is  a  strictly  decreasing  function  of  a  for  fixed  xD. 
Hence  for  every  c,  0<c<l,  there  is  a  unique  a(c)  such  that 


u  (x0>a  (c) )  -  c  . 


(2.7) 


Consequently,  we  also  have  that  an  (see  (2.3))  is  the  solution  in 


a  of 


u  (x  ; a)  =  a  . 
o 


(2.8) 


(2.8)  is  established  exactly  as  in  Harris  and  Soms  (1980). 

The  methodology  to  be  employed  is  as  follows.  If  F(x;X)  is 
a  Schur-concave  function  of  R^  *  -In  X^,  i“l/2, . . . ,k,  then  it 
follows  that  u(xo;a)  *  Ftx^a3^*!),  where  1  =  (1,1,...,!),  and 
then  the  solution  in  a  of  u(xo;a)  *  a  is  an  optimal  upper  confi¬ 
dence  limit  for  n  X..  This  will  entail  verifying  (for  fixed  x  ) 

i-1  1 

that 


<Rj-R2) 

(see  Marshall  and  Olkin 
following  theorem. 


3F(xo;X) 


(1974),  p. 


3F(xo;X)  1 


3  R. 


<  0 


(2.9) 


1190) .  Accordingly  we  have  the 


k 

Theorem  2.1:  Let  g(x)  -  n  (x,+d),  l<d<1.5,  k>3.  Define  6..  as 
- -  iml  x  J 

the  j -vector  all  of  whose  components  are  zeros.  Then  let 
x(1)  -  0k,  x(2)  -  d.0k_1),  x(3)  -  (2,0k„1),  x(4)  »  (l,l,0k,2), 
x<5>  ■  (3,0k_1),  x<6>  *  (4'°k-l^'  X^7)  "  (2,l,0k_2)  and 
x*s*  *  <5 , ®k— "  The  8et  Ai  defined  by  (2.2)  is  the  point  x^ 
and  the  different  permutations  of  its  components,  i*l , 2 , . . . , 8 . 
Further,  for  j*l,2 , . . . ,7,  F(x(^;X)  is  Schur-concave  in  R^, 
1*1,2,. . . ,k. 


Proof:  In  the  sense  of  the  ordering  given  by  (2.2),  obviously 
x(l)  <  x(2)  <  x(3)  K  x(5)  <  x(6)  <  x(8)#  Trivially,  d (2+d)  <  (1+d)2 

and  hence  (2+d)dk-1  ■  g(x(3))  <  (l+d)2dk"*  -  g(x(4)).  Similarly, 

since  l<d<l . 5,  (1+d)2  <  d(3+d)  and  hence  g(x*4*)  <  g(x*5*).  In 

the  same  way  g(x(6))  <  g(x(7)),  g(x(7))  <  g(x*8))»  g(x(8))  <  g(l,l,l,0h_3) 

( 8 ) 

and  g(x  )  <  g(2,2,0k_2),  establishing  the  first  part  of  the  conclusion. 
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In  order  to  establish  Schur-concavity ,  wc  must  verify  (2.9) 


Thus  consider 


F(x(A>  fX) 


where  X^  ■  e  R*.  Define 


k  -X.  x.(^/ 

-  .  I  n  e  1  X.  i/  X. 

XjSAj , j<£  i-1  ' 


<j> 


k  -Ri 


(2.10) 


(2.11) 


Letting  R  *  (R^, . . . ,9^) ,  we  obtain 


- (1)  -  ~Ri 

G (x ' jR)  -  (e  1  -  e  *)  , 


ij\  .  “Ri  X  "R. 

G(x^}  ;R)  »  (e  1  ~  e  Z)  (  £  e  x)  , 

i-1 


-R.  -R-.  k  -R,  k  "  i 

G(x(3);R)  *  (e  1  -  e  2)  (  £  e  1  +  £  £-* — )  , 

i-3  i-1  2 


G(x<4);R)  -  (e~Rl  -  e"R2)(  £  ^  +  \  , 

i-1  i< j 

-(5)  -  ”R1  'R2  £  e’2Ri  ”Ri“R2 

GU15'  jR)  -  (e  1  -  e  2)  (  £  —  +  £ — - - 

i-3  2  2 


k 

♦  z 

i<j#  u,})/n,2't 


-R . -R  ,  k  ”3Ri 

1  1  ♦  r  s-tt-)  . 


i-1 


G(i<6>,B)  -  (."*1 -.***>  ,  I 


k  "2Ri  _Rl'R2 
«_  +  • -  + 


i-3 


k 

I  ® 

i<j, <i, j)*(l,2) 


-Ri-Rj 


k  "3Ri  "2Rl”R2  ~Rl”2R2  k  "4Ri 

+  ih  “JT  Ti  +  Jj  -Ti—>  - 


and 
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-(7)  -  -Ri  -r2  e’2*1"*2  e"*1"2*2 

G  (x  7  j  R)  *  (e  X-e  2)  <- - 5 -  +  ^ -  +  l  e 


31*13 


k  '3Ri  k  _“4Ri 


+  i-3  31  +  i-1  41  +  iytj.(i,j)£(1.2)or<2v 


-2R..Rj 


1) 


-R, 


-R- 


-)• 


—R%  “R« 


ni  *»2  *>1  *»2 

Now  R^>R2  implies  e  <  e  and  thus  (R^— )  and  (e  -e  ) 

~  (i)  ' 

have  opposite  signs.  Hence  it  follows  that  F  (x  jX),  i*l,2,3, 

4,5,7  is  Schur-concave  in  R , .  The  verification  that  F(x^jX)  is 

-R2  -R. 

Schur-concave  may  be  accomplished  by  letting  k«2,  e  * ce  , 
c>0,  and  examining  the  discriminant. 

To  show  that  (2.9)  need  not  be  positive  for  all  xq,  consider 


-6R.  -6R, 

+  e 


k  *  2  and 

x  *  (7,0). 
o 

-R  -R2 

G(x  ;R)  * 
o 

(e  1  -  e  2 

-4R.-R,  -: 

e  +e 

-3R.-R-  - 

e  1  +e 

TT 


"ST 


’+e 


'+e 


-SRj-Rj  -2R1-2R2  -R1-3r2  '2Rx"R2  ~R1“2R2 


+e 


41  3 

””R1  *"R2 

and  this  is  Schur-convex  near  e  ■  e  *  4. 

Buehler  provided  an  extensive  discussion  of  this  problem  for 

the  ordering  function  determined  by  the  product  of  the  upper 

confidence  limits  for  the  individual  components.  In  particular, 

he  provided  some  numerical  tabulations  for  k»2.  Asymptotically 

Buehler' s  ordering  function  is  given  by 

9BW  "  ^  <Wxi/2>  ' 


-  l-(l-a)1/k 

the  standard  normal  cumulative  distribution  function.  It  is  easy 


where  a'  ■  l-(l-a)  '  ,  z0  satisfies  •  1-a  and  $(x)  is 


to  see  that  gfi(x)  is  Schur-concave  (see,  e.g.,  Marshall  and  Olkin 
(1974)  ,  p.  1191) . 
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3.  Bounds  on  Confidence  Limits 

In  this  section  we  employ  memorization  techniques  described 

in  Proschan  and  Sethuraman  (1977)  and  Nevius,  Proschan  and 

Sethuraman  (1977)  to  obtain  bounds  for  a  .  Throughout  this  sec- 

n 

tion  we  assume  only  that  the  ordering  function  g (x)  is  strictly 

increasing  in  each  component  and  Schur-concave  and  thus  the  set 

B-  will  be  Schur-convex  (see  the  discussion  immediately  preced- 
xo 

ing  (2.5)). 

In  order  to  proceed,  we  need  the  preliminary  results  estab¬ 
lished  below. 

1/k 

Theorem  3.1:  Let  c  and  a  be  given  with  oka  and  consider  the 

set  A (a,c)  of  vectors  X  «=  (Xj ,  X2 , . . . ,  Xk)  ,  Xi>0,  such  that 

k  k 

n  X.  «  a  and  £  X,  *  c  .  (3.1) 

i-1  1  i*l  1 

Let  ®  >  max  ^  X..  Then  there  is  a  unique  X*eA(a,c)  of 
3  XeA(a,c)  i«l  1 

the  form  X^  ■  ,  l<i<j,  Xi  "  mj'  «  jM^. 

1/k 

Proof :  The  condition  oka  1  is  a  consequence  of  the  arithmetic- 
geometric  mean  inequality  and  insures  that  A(a,c)  is  non-trivial 
for  k>3.  If  k»2,  there  is  only  one  solution  of  (3.1)  with  X1>X2» 
and  hence  the  Theorem  is  trivially  true.  Consequently,  suppose 
k>3.  Then  for  fixed  j,  (3.1)  requires  that  any  solution  of  the 
required  type  satisfy 

jMj  +  (k-j)m^  «  c  ,  m^*”3  *  a 

and  hence  setting  m^  ■  (c-jMj)/(k-j) ,  we  consider 
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(3.2) 


f ^ (M)  “  ( (c-jM)/(K-j) ,  1 <j <k-l ,  0<M<c/j  . 


Note  that  f j  (0)  -  fj(c/j)  -  0,  and 


f  j  (H)  -  (c-Mfc)  C1^-)  (£i^> 


k-j-1 


(3.3) 


Thus,  f j (M)  is  increasing  for  0<M<c/k  and  decreasing  otherwise, 
further  f j (c/k)  -  (c/k)k>a.  Hence  there  is exactly  one  solution 
Mj  of  f^(M)«a  with  Mj>c/k,  and  therefore  M^>nK. 


Now  assume  that  for  some  j,  l<j<k-l,  the  vector 
X*  «  (X*,X*, . . . ,X*)  with  ^  X*  «  Sj  is  not  of  the  form 
(Mj  , . . .  ,Mj ,  m^,...,mj).  Then  let  X^  -  S^/j  and  Xjj  *  (c-S  ^ )  /  (k- j ) 

— «  ■  ^  s  v  * 

j  .k_j. 

Define  X  ^  ~  ^ji  **  ^"lj/  *  ^ji  "  ^2  j  * 

j+l<i<k.  Since  the  geometric  mean  of  a  set  of  positive  numbers 
whose  sum  is  fixed  is  a  maximum  when  they  are  all  equal,  we  have 

k  9 

nx!.  > a.  Now  X.  is  of  the  required  form,  however,  from  (3.2)  and 
i-1  J1  k  ,  3 

(3.3),  n  X..  >a  implies  that  there  is  another  solution  of  the 
i-1  31 

required  form  with  X^  >  S ^/j ,  l<i<j,  contradicting  the  maximality 
of  S j . 

From  (2.5)  and  (2.6),  we  can  write 


u(x^;a)  -  sup  (B-  )  -  sup 


n  X.-a 
i-1  1 


c  k  k 

I  X,-c,JIX.-a 
i-1  1  i-11 


Pr(B-  )  .  (3.4) 

A  *o 


We  state  now  the  main  result  of  this  section,  using  Theorem  3.1. 

Theorem  3.2t  Let  v^  -  M,,  v,  -  iM^- ( i-1) M,  , ,  2<i<k-l, 

~  k-1 

v.  -  c  -  l  v.,  where  M<  is  specified  by  Theorem  3.1.  Then 
*  i-1  1  1 


(3.5) 


u(xo;a)  <  sup  P-(B~  )  . 

c  o 

j  * 

Proof:  Since  I  v,  ■  S..,  l<j<k-l,  J  v,  -  c,  v  majorizes  every 

i-1  1  3  i*=l  1 

k  k 

X  with  l  X.  »  c,  n  X.  -  a  {Theorem  3.1).  Then  (3.5)  follows, 
i-1  x  i-1  1 

since  if  X^  majorizes  Xj#  then  for  any  Schur-convex  set  A, 

p?  (A)  >  Pr  (A)  (Proschan  and  Sethuraman  (1977)  and  Kevius, 

A1  a2 

Proschan  and  Sethuraman  (1971)  ,  p.  264  and  pp.  267-9) . 

The  vector  v  may  be  interpreted  as  the  best  vector  that  major 

k  k 

izes  all  vectors  X  such  that  £  X.  -  c  and  II  X  -  a.  More 

*  i-1  __  i-1  1 

specifically,  there  is  no  vector  w  ?  v  such  that  v  majorizes  w  and 
w  majorizes  all  X  satisfying  the  two  conditions  given  above. 

The  following  is  a  suggested  method  for  employing  Theorem  3.2 
Find  ad  such  that 

°  ■  F(vad1/k  x>  • 


Next  calculate  the  smallest  a,  say  a  ,  such  that  sup  P- (B-  )  <  o. 

c  v  xo 

If  a„  -  a. ,  this  is  the  exact  solution.  Otherwise  a . < a  and 
m  a  a  in 

sup  P* ( B-  )  <  a  (here  a  -  a)  and  the  solution  a  satisfies 
a  v  x  m  n 

c  o 

ad-an-am*  Tlie  vector  v  may  be  calculated  by  any  of  a  variety  of 
numerical  techniques.  In  the  numerical  examples  presented  here, 
interval  bisection  was  employed. 

Example  1:  Let  k  «  5,  a  -  25,  c  «  15.  Then  the  4  vectors  X^  x2, 
Xj  and  X4  of  Theorem  3.1  are 


X1 

■ 

(9.9660, 

1.2585, 

1.2585,  1.2585,  1.2585) 

X2 

- 

(6.2004, 

6.2004, 

.8664,  .8664,  .8664) 

X3 

- 

(4.6696, 

4.6696, 

4.6696,  .4955,  .4955) 

and 

m 

(3.7172, 

3.7172, 

3.7172,  3.7172,  .1309)  , 
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from  which  v  is  determined  to  be 


v  =  (9.9660,  2.4349,  1.6079,  .8601,  .1309)  . 

Note  that  in  the  above  example  v^v^  •  •  *— v**  This  in  fact 
is  always  true,  as  the  following  theorem  establishes. 

Theorem  3.3:  For  v  defined  by  Theorem  3.2,  we  have  v^>Vj> . . , >v^. 

Proof;  It  follows  immediately  that  v^>v2,  since  M^Mj.  Consider 
therefore  v.. ,  j>2.  v^>Vj+1,  2, 3, . . . ,k-l  holds  if  and  only  if 


jMj-(j-l)M._1  >  <j+l)Mj+1-jM. 
or 

jMj  >  ((j+l)Mj+1+(j-l)M._1)/2  , 

where  Mk  ■  c/k  (satisfying  the  condition  *»  c  =  kM^  of  Theorem 
3.1) . 

Let  XA  *  (l-aj)X^_1  +  OjXj+1,  j* 2, 3, . . . ,k-l,  where 
ci j  -  (1/2)  +  (1/  (2  j ) )  and 

Xj  “  jl' X j2 ' *  *  * ,Ajk) 

and 


Xji  =  Mj  '  =  ">j  ,  j+l<i<k  . 

It  follows  that 

k 

n  X.  j  >  a  , 
i-i  Ayx 

k 

since  l  In  x,  is  a  concave  function  of  x, Now  let 
i-1  ^  1  1  .  k 


'B 


j  f  k 

-l  "  Hi  S'’1,  i"1'2 . j'  S'1 


i“ j+1 , . . . ,k.  Then 

j 

n  x 


.  XB.,i  -  ^  XA  i  '  j*l , 2 , . . . ,k-l  . 
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Thus,  using  the  properties  of  in  Theorem  3.1, 

jMj  >  ( j-l)  l  (1"aj^Mj-i+ajMj+i5  +  ♦  ajMj+i  ' 

yielding 

jM j  >  ({j+l)Mj+1+(j-l)Mj_1)/2  +  (1-a , 
which  establishes  the  theorem. 

To  illustrate  the  techniques  of  this  paper,  we  compare  numer¬ 
ical  values  obtained  by  the  above  method  with  those  given  in  the 
examples  from  Mann,  Schafer  and  Singpurwalla  (1974,  p.  505).  From 
now  on  we  assume  d  =  1.1. 

Example  2:  For  xo  *  (1,2,1)  we  obtain  afl  «  *n  «  20.56  for  a  -  .10. 
In  Mann,  Schafer  and  Singpurwalla,  an  AO  non-randomized  confidence 
bound  of  20.7  is  obtained. 

Example  3:  Let  xq  *  (2,3,5),  a  -  .10.  Then  we  obtain  ad  *  135.46. 
A  summary  of.  computer  calculations  which  establishes  135.46  <  a^  < 
142.46  is  given  below  in  1  .dale  1.  With  the  exception  of  the 
likelihood-ratio  value  of  133  and  the  AO  non-randomized  confidence 
bound  of  129,  all  the  other  confidence  bounds  given  in  Mann, 

Schafer  and  Singpurwalla  exceed  the  upper  bound  of  142.46.  For 
k®3  it  is  possible  to  do  a  direct  computer  tabulation  of  u(xQ;a). 

This  gives  an  »  135.46,  the  diagonal  value.  (See  Table  1  on  the 
following  page.) 

The  two  examples  below  are  for  four  and  five  component  systems 
for  which  there  are  no  comparable  numerical  examples  available. 

Example  4s  Let  x_  -  (2, 2, 2. 2)  and  a  -'.lO.  Then  a.  -  a  -  150.63. 

o  an 
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1.  Summary  of  Calculations  Used  to 
Obtain  the  Upper  Bound  for  an  in  Example  3 


a 

V 

sup  P~ <B- 

c  v  xo 

135.46 

13.0680 

4.7283 

1.7108 

.1101 

136.46 

13.0867 

4.7409 

1.7173 

.1086 

137.46 

13.1053 

4.7532 

1.7240 

.1071 

138.46 

13.1239 

4.7656 

1.7305 

.1057 

139.46 

13.1423 

4.7780 

1.7370 

.1042 

140.46 

13.1607 

4.7902 

1.7435 

.1028 

141.46 

13.1789 

4.8024 

1.7500 

.1014 

142.46 

13.3299 

4.8057 

1.7325 

.9999 
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Example  5:  Let  xQ  ■  (2,2, 2, 2,2)  and  u  =  .10.  Then  a^  «  429.69. 

A  summary  of  the  computer  calculations  which  establish  429.69  < 
an  <  435.69  is  provided  in  Table  2.  (See  the  following  page  for  this 
table.) 

1/k 

As  a.  increases,  the  difference  between  a.  and  a  becomes  wider 
a  an 

Thus  the  techniques  of  Section  3  are  more  useful  for  small  x  ,or  equiva- 

1/k 

lently,  small  a^  .  For  example,  for  xQ  *  (5,5,5),  a^  »  387.18, 

and  it  is  not  practical  to  compute  affl  because  it  is  much  bigger 

than  a..  However,  direct  tabulation  of  u(x  ;a)  reveals  once  more 
a  o 

that  a^  **  u(xQ;a).  A  justification  of  why  a^  =  u(xQ;a)  for  large 

l/k 

a^  is  given  in  the  Appendix.  This,  together  with  the  results 
of  Section  2,  suggests  very  strongly  that  for  all  practical 
purposes  a^  -  an« 

Remarks;  Note  that  Tables  1  and  2  are  virtually  linear  in  their 
behavior  in  the  neighborhood  of  the  solution.  This  suggests  that 
solutions  are  obtainable  by  interpolation  and  then  one  should 
subject  them  to  verification. 

The  calculations  described  above  utilized  two  short  FORTRAN 
programs  for  2-10  components.  Listings  are  obtainable  from  the 
authors. 


4.  Comparisons  with  Buehler ' s  Tables 

In  order  to  provide  an  illustration  of  the  performance  of 
k 

g(x  )  -  n  <x,+d),  l<d<1.5,  when  compared  with  the  tables  given 
i-1 

by  Buehler  (1957),  we  chose  d-1.1,  k-2.  For  k«2,  the  values  of 
®n  and  coincided  for  both  the  ordering  based  on  g(x)  and 
Buehler' s  ordering  and  further  were  for  all  practical  purposes 


2.  Summary  of  Calculations  Used  to 
Obtain  the  Upper  Bound  for  a^  In  Example  5 


a 


429.69 

430.69 

431.69 

432.69 

433.69 

434.69 

435.69 


*“p  pv(B;  1 
c  o 

.1016 

.1013 

.1010 

.1007 

.1004 

.1001 

.0998 
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equal  for  the  two  different  orderings. 

In  Table  3  we  give  Buehler's  upper  confidence  limit,  Buehler's 
diagonal  value  and  the  exact  upper  confidence  limit  and  diagonal 
value  corresponding  to  g,  denoting  them  by  ®nB»  a^B,  ang  an^  adg* 
respectively.  These  values  are  provided  for  all  failure  combina- 
tions  from  (0,0)  to  (5,5)  for  a*.l. 

See  next  page  for  Table  3 

An  examination  of  Table  3  shows  that  differences  between  the 
four  alternatives  presented  are  small  for  the  specific  example 
(k*2,  a».l). 


5.  Concluding  Remarks 

In  this  paper  a  procedure  for  obtaining  bounds  on  an  optimal 
upper  confidence  limit  for  the  failure  probability  of  a  parallel 
system  is  given.  The  procedure  employs  the  theory  of  major- 
ization  and  is  valid  for  an  arbitrary  number  of  components  and 
gives  the  exact  answer  or  narrow  bounds  when  the  observed  number 
of  failures  is  small  for  each  component.  In  addition,  numerical 
and  asymptotic  justification  is  given  for  using  a^  as  an 
approximation  to  an«  Tables  of  a^  are  in  preparation  for  moderate 
numbers  of  failures  for  3,  4  and  5  components  and  will  be  avail¬ 
able  in  the  near  future. 
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and  a  < 


Appendix 


Theorem  A1  •  Let  <  1  ^  i  £  k,  be  independent  identically 

distributed  normal  random  variables  with  means  X  and  variances 
Let  X2i,  1  <  i  <  k,  be  independent  normally  distributed  random 
variables  with  means  and  variances  t^,  where  ••  X  +  0(XC) 


c  <  l,as  x-*-*,  and  n  t.  *  X  .  Let  6  be  given#  0  <  B  <  1, 


i*l 


let  a  be  a  specified  positive  real  number,  let  Z  e  n  (x  ♦  a) 
K  3 


H  (X2j  +  a)  and  let  d(X)  satisfy 


j-1 


P[Z1  <  d  (X)  ]  -  0, 


(A. 


Then  as  X  , 


6  -  P[Z2  <  d  (X )  ] 


f  0((ln  X)1,5X'1]  ,  c  £  0 # 


l  on*-1] 


(A. 


,0  <  c  <  1. 


Proof:  Throughout,  let  <J>  and  •  denote  the  density  and 

distribution  function  of  the  standard  normal.  Clearly, 

Plzl  1  -  PU2  <  d{X)]  «  /  /  •••  /  (f,(x)  -  f  (x))dx, 

-C#  -GO  1 

X 

lx:  n  (x.  +  a)  <  d  (X )  } 

3-1  ^ 

where  x  ■  (x^,x2  # . . .  #Xj{)  #  f^  is  the  probability  density  function 
of  xn*x12/ • • • 'xlk  f2  is  that  of  X21,X22 , . . . ,x2k«  Now 


I* 


>  -ft,  j  -  1,2 , . , , ,k]  1  (1  -  -j—sy  <'<^T72)) 


(A. 4) 


P(Xj.  >  -a#  j  “  1,2,  ...,k]  >_  II  I  1  ~  ~ 


T  V*  ,A.*. 


(A. 5) 


F 

Consequently,  for  X  sufficiently  large,  there  exists  a  constant 


0  such  that 


PlXi,  >  -a,  j  -  1,2, ... ,k]  >  1  -  e™  ,  i  -  1,2,  (*.6> 


Then,  for  i  «  1,2, 


P ( Z±  <  d(X) ]  »  PCZi  <  d(X),Xij  >  -a,  j  -  1,2, .. . ,k] 


.  +  Piz.  <  d (X )  ,  U  (X.  .  <  -a)]  , 
1  j-1  i3 


■  - y  '  !  t,'*  uV.':,;v 


.and  therefore 


PlZi  <  d(X)]  -  P[Z±  <  d (X) ,Xi j  >  -a,  j  *  1,2, .. . ,k]  <  e" 


(A.  7) 


Next,  -we  calculate 


PlZ1  ^  d  (X)  ^  —  a ,  j  “  1,2,  .. .  ,k)  —  P  [  Zj  £  d(X),X2j  21  ”*a ,  j 


1,2, 


•  PfZ^  <  d  (X)  ,X^  j  2.  ~  a ,  j  ■  l,2,...,k|x^j  *  x  j »  j  *  2,3, ...,k] 


.  P  X  <  d(X)  -  a 
p  XU  -  k  * 

n  (x.+a) 


-  ,  ...  X  /XV2 


n  (x.+a) 
j-2  3 


♦  (b- )  . 
X 


(A.  8) 


Therefore 


PIZ,  <  d(X),X..  >  -a,  j  *  1,2,. ..*10 

1  -  13  (A. 9) 

-  f  f  •**/  *(b-)g1(x2»x3,...,xk)dx2dx3. ..dxk  , 

-a  -a  -a  X 

where  gx <x2,x3, . . . ,xR)  is  the  probability  density  function  of 
X12'X13'***'X1K*  FrOTn  <A*6)f  We  haVe  that 

0  BD  ® 

/  j  4>  (b-) g^tXjfXj , . . .  #xk)dx2dx3. .  .dxk 

“®  —  —  X  (A.  10) 

•  »  •  .  ,  -mX 

-  J  /  •••/  ♦(b~)g1(x2/x3, . . . ,xk)dx2dx3. . .dxk  <  e 

-a  -a  -a  X 

Hence  we  will  estimate  the  first  expression  on  the  left  hand  side  of 
(A. 10).  Similarly ,  for  Z2  we  will  consider 

g2  (x2  ,x3, . . .  #xk)dx2«3x3. .  «dxk 

(A. 11) 


J  /  •••/ 


d  (X) 


n  (x,+t,) 
Pj.2  3  3 


-  a  -  T,1  /t 


1/2 

1 


where  g-.Uj.Xj . *k>  i»  the  probability  density  function  of 

x22,X23,...,x2k.  In  the  first  integral  in  (A. 10),  let 

1/2 

(yi  -  X)/X1/2  .  ut  and  in  (A.ll)  let  (yi  -  -  ui( 

i  *  2,3,...,k#  obtaining 
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Thus 


(X1/2  -  kj (X) )  n  (1  +  X^x’1/2  +  aX-1)  1  -  X1/2  -  ax 


-1/2 


k  •  / 

,  d  j-2  3 


-  ‘ J2«i 


-1/2, 


k  2. , -1/2 


+  (  I  I  x  x  )X 
2<i<  j  J 


"1/2  +  0( (in  X)1*5!"1) 


c-1, 


Similarly,  using  t.  <=  X  /  n  t  .  ,  Tj/X  c  1  +  0(X 

*  j*2  J 

Uj/M1/2  -  1  +  DU0’1),  j  -  1.2 . *•  |xil  <  H,  we  heve 


d(X) 


-  a  -  t. 


n  (r!/2x,+T.+a) 
j«2 


A 


1/2 


j  3  D 
1/2 


(X1/2-kd(X)>n  (l+Xjt'  /  +axT  )■  -X 
j*2 


2» , -1/2 


(t  ^/x^) 

=  -  I  x^Ul-kaX'^+^aH  |  *i>l‘1/2+<  Jj*iU 

i»2 

+  (  l  l  x  ,  x  . )  X”1/2  +  0(XC-1)+0( 

2<i< j  3 


(A.  14) 


-at 


-1/2 


(A. 15) 


c-!v (in  X)1*5X"1) 


Combining  (A, 14)  and  (A. 15)  with  (A. 7),  (A. 9),  (A. 10)  and  (A. 11) 
establishes  the  theorem. 

For  c  <  \  standard  weak  convergence  arguments  show  that 

iim (6  -  PlZ2  <  d<x)J>  “  °* 

In  this  case  Theorem  A1  provides  additional  information  by  specifying 
the  rate  of  convergence. 
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By  standardizing  the  first  expression  in  (A. 13)  and  applying 
the  dominated  convergence  theorem  the  following  result  can  be 
obtained. 

Theorem  A2 :  Let  1  <  i  <  k,  be  independent  identically 

distributed  normal  random  variables  with  means  X  and  variances  X. 
Let  X2i'  1  5.  i  be  independent  normally  distributed  random 

variables  with  means  and  variances  where  i ^  *  X  +  0(XC), 

c  <  1  and  let  Z1,Z2  an<*  d(X)  -be  specified  as  in  Theorem  Al. 

Then 

Um(B  -  PIZ,  <  d (X ))  -  0. 

X-~» 
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reliability  based  safety  factors 

FOR  CONCRETE  STRUCTURES  SLIDING  STABILITY 

Paul  F.  Mlakar 
Structures  Laboratory 

U.  S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  MS 

SYNOPSIS.  A  safety  factor  criteria  for  sliding  stability,  which  explic¬ 
itly  quantifies  load  and  resistance  uncertainties,  is  developed  through  the 
probabilistic  analysis  of  a  simplified  illustrative  problem.  Certain  points 
of  this  analytical  development  are  noted  to  warrant  further  study. 

I .  INTRODUCTION .  A  concrete  gravity  structure,  such  as  a  dam,  will 
fail  by  sliding  along  a  critical  foundation  surface  when  the  resultant  effect 
of  externally  applied  forces  exceeds  the  total  resistance  developed  along  this 
surface.  (US  Army  Corps  of  Engineers,  1958).  (US  Bureau  of  Reclamation, 
1976).  A  practical  means  of  assuring  the  safety  of  such  structures  is  to 
require  that  the  ratio  of  available  resistance  to  the  effect  of  the  applied 
load,  termed  the  factor  of  safety,  exceeds  a  minimum  value  greater  than  unity. 
Heretofore,  this  value  has  been  established  somewhat  subjectively  on  the  basis 
of  engineering  judgement.  The  implicit  purpose  of  this  factor  is  to  account 
for  uncertain  definition  of  the  loading  and  resistance.  Accordingly,  proba¬ 
bility  theory  could  be  used  to  supplement  engineering  judgement  in  objectively 
establishing  a  minimum  safety  factor.  Such  a  procedure  is  detailed  herein 
for  a  simplified  example  problem  to  facilitate  a  discussion  at  this  clinical 
session.  This  development  will  indicate  items  requiring  further  research 
before  a  reliability  based  sliding  safety  factor  can  be  practically 
implemented. 

II.  MECHANISM  OF  SLIDING  FAILURE.  Figure  1  is  a  free  body  diagram  of 
a  concrete  gravity  dam  cross  section  under  its  normal  operating  condition. 
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In  this  diagram,  H  represents  the  resultant  of  the  hydrostatic  pressure 
exerted  on  the  upstream  face  of  the  cross  section  by  the  reservoir  when 
filled  to  the  spillway  elevation  and  W  is  the  total  weight  of  the  daa. 

For  illustrative  purposes  only,  the  critical  foundation  surface  is  shown  to  be 
oriented  horizontally  and  the  total  shearing  and  normal  forces  acting  on  this 
plane  are  designated  by  Q  and  N  respectively.  If  this  structure  is  stable 
against  a  sliding  failure,  the  condl  $  of  translational  equilibrium  require 
that 

Q  -  H  (1) 

and 

N  “  W  (2) 

Now,  the  maximum  total  shearing  resistance  R  which  can  be  developed 
along  the  foundation  surface  is  approximated  by  (Lambe  and  Whitman,  1969) 

R  -  C  +  N*tan$  (3) 

In  this  expression,  C  represents  the  total  cohesive  resistance  which  can  be 
developed  in  the  absence  of  any  normal  force  and  $  is  termed  the  friction 
angle  of  the  foundation  material.  Figure  2  Indicates  that  the  shearing  re¬ 
sistance  available  from  equation  (3)  to  counteract  externally  applied  loads 
is  a  function  of  the  foundation  material  properties,  C  and  $  ,  and  the 
normal  force  N  induced  by  these  loads  through  equation  (2). 

The  safety  factor  is  usually  taken  to  be  ratio  of  available  resistance 
to  load  effect  or  in  this  illustration 

F  *  l  <*> 

Accordingly,  the  structure  Js  stable  against  sliding  for  values  of  F  not 
less  than  unity.  As  neither  Q  nor  R  can  be  known  with  certainty,  prudent 
practice  has  traditlc  -*ily  required  that  the  calculated  F  exceed  unity  by  a 
comfortable  margin.  The  magnitude  of  this  margin  has  traditionally  been 
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determined  through  a  professional  consideration  of  the  uncertainty  in  load  and 
resistance  definition  as  well  as  the  consequences  of  structural  instability  in 
specific  situations. 

III.  PROBABILISTIC  ANALYSIS.  Probability  theory  provides  a  means  of 
rationally  quantifying  the  various  uncertainties  associated  with  sliding 
stability  as  follows.  First,  for  computational  convenience  in  this  illustra¬ 
tive  problem,  assume  that  the  load  is  described  by  the  value  of  a  lognormal 
random  variable  having  a  median  m ^  and  a  coefficient  of  variation  . 
Similarly,  model  the  resistance  by  the  value  of  a  lognormally  distributed 

random  variable  with  median  m  and  coefficient  of  variation  V  .  If  load 

K  R 

and  resistance  are  further  presumed  to  be  Independent  of  one  another,  it  then 
follows  that  the  safety  factor  is  lognormally  distributed  with  median 


and  coefficient  of  variation  V_ 

F 


satisfying 


ln(Vp+l)  -  ln(V^+l)  +  ln(Vp+l) 


(5) 


(6) 


A  further  consequence  of  these  conditions  is  that  the  reliability  against 
sliding  failure  is 


where  4>(<)  is  the  cumulative  distribution  function  of  the  standard  unit 
normal  variate.  Theoretically,  one  could  use  this  distribution  to  compute 
the  probability  of  sliding  instability  (Prendergast ,  1979).  However,  practical 
olfficulties  exist  with  this  concept  because  there  is  insufficient  Justifica¬ 
tion  for  the  assumed  lognormal  distribution  of  load  and  resistance,  because 
an  explicit  criteria  for  an  acceptable  failure  probability  is  difficult  to 
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establish  and  because  many  practicing  civil  engineers  are  unschooled  in 
probabilistic  calculations. 


IV.  RELIABILITY  BASED  SAFETY  FACTOR.  Similar  difficulties  in  the 

structural  design  of  buildings  have  recently  been  circumvented  by  establishing 

safety  factors  based  on  probabilistic  computations  which  can  then  be  used  in 

the  traditional  deterministic  manner  (Ellingwood  et  al.  1980).  A  corresponding 

approach  in  the  case  of  sliding  stability  would  proceed  by  estimating  the  level 

of  reliability  implied  by  current  design  practice.  A  recent  examination 

(Baecher  et  al,  1980)  of  modem  United  States  dams  of  all  types  disclosed  a 

-4 

failure  rate  from  all  failure  mechanisms  of  2  x  10  per  dam-year  of  which 

approximately  102  were  attributed  to  slides.  A  preliminary  estimate  for  the 

sliding  reliability  implied  by  current  design  practice  for  the  normal  operat- 

-4 

ing  condition  might  then  be  r  ■  1  -  0.10  x  2  x  10  »  0.99998.  Equations  (5) 

and  (7)  and  the  tabled  values  of  the  normal  distribution  then  imply  that  an 
appropriate  median  safety  factor  is  given  by 


ln(vj+l)  +  ln(V*+l) 


(8) 


In  Figure  3,  it  is  seen  that  this  criteria  desirably  requires  a  higher  safety 
factor  as  the  uncertainty  about  load  or  resistance  increases.  Surely,  the 
appropriateness  of  the  lognormal  model  assumed  in  the  foregoing  warrants 
further  investigation.  Data  in  addition  to  (Baecher  et  al_,  1980)  should  also 
be  examined  to  refine  the  reliability  level  implicit  in  equation  (8)  before 
implementing  this  procedure.  The  result  of  these  studies  would  be  a  relation 
among  nip  ,  VR  and  that  more  realistically  described  the  sliding  stability 
of  a  concrete  structure. 


V.  APPLICATION.  The  above  criteria  would  be  applied  in  the  following 
Btep-by-sitp  procedure: 
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1.  Estimate  the  load  effect's  coefficient  of  variation  Vq  .  This 
estimate  should  encompass  not  only  the  uncertainty  of  the  variables 
from  which  the  load  is  computed  but  also  the  accuracy  of  the  analytic 
cal  model  used  in  the  computation. 

2.  Similarly  estimate  considering  not  only  the  sample  variabi¬ 

lity  of  test  data  but  also  any  differences  between  measured  and 
in  situ  values  as  well  as  the  accuracy  of  the  resistance  model 
adopted  through  equation  (3). 

3.  Enter  Figure  3  to  obtain  the  median  safety  factor  required  con¬ 
sistent  with  the  load  and  resistance  uncertainties  determined  in 
steps  1  and  2. 

4.  If  a  structure's  safety  factor,  computed  from  median  values,  equals 
or  exceeds  the  required  value,  the  design  is  acceptable  as  in 
traditional  deterministic  practice. 

5.  If  a  structure's  computed  median  safety  factor  is  less  than  the 
required  value,  the  design  has  not  been  shown  to  be  acceptable. 

It  can  now  be  modified  to  an  acceptable  level  as  in  traditional 
practice.  Alternately,  further  design  studies  can  be  conducted 
in  hope  of  reducing  or  and  thereby  requiring  a  lower 

trip  which  the  original  design  may  satisfy. 

Notice  that  this  procedure  allows  the  engineer  to  quantify  his  uncertainty 
about  load  and  resistance  rather  than  relying  on  only  nominal  deterministic 
values  for  these  variables.  However,  some  guidance  on  this  quantification 
must  be  developed  for  practical  use. 

For  example,  suppose  that  the  normal  operating  load  on  a  dam  is  charac¬ 
terized  by  m^  -  300  kip  and  a  relatively  small  »  0.05  since  this  load 
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effect  results  from  a  veil  understood  hydrostatic  pressure.  From  the  dam's 
weight  and  preliminary  estimates  of  foundation  material  properties  the  resis¬ 
tance  is  thought  to  be  described  by  mR  ■  550  kip  and  VR  »  0.50  .  The 
corresponding  nip  ■  ■  1.8  which  is  less  than  the  value  of  2.0  required  in 

Figure  3  to  be  consistent  with  the  VR  ■  0.50  and  Vp  ■  0.05. 

Now  suppose  further  foundation  investigations  are  performed  to  qualify 
this  design  which  revise  the  estimated  resistance  parameters  to  be  ni^  ■ 

500  kip  and  VR  *  0.40  .  The  updated  median  safety  factor  becomes  nip  *  ■ 

1.7  which  exceeds  the  value  of  1.6  now  required  in  Figure  3  to  be  consistent 
with  the  improved  estimate  of  VR  . 

VI.  CONCLUSION.  A  reliability  based  safety  factor  for  a  simplified 
example  of  sliding  stability  has  been  developed  which  quantifies  the  engineer's 
uncertain  knowledge  of  load  and  resistance.  Simplifying  assumptions  in  both 
the  deterministic  and  probabilistic  analyses  leading  to  this  criteria  should 

be  critically  examined.  Statistical  data  are  also  needed  to  refine  the  calibra¬ 
tion  of  this  procedure  with  current  deterministic  practice. 
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Figure  1.  Concrete  Gravity  Dam  Under  Normal 
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ABSTRACT.  The  analysis  of  genetic  testing  in  paternity  disputes  uses 
several  probabilities.  Trie  commonly  used  probabilities,  first  developed  in 
Europe,  are  discussed.  Two  additional  probabilities,  based  on  (1)  the 
possible  fathers  of  the  child  when  maternity  is  assumed  and  (2)  the  possible 
children  of  the  mother  and  putative  father,  are  introduced  to  measure  the 
putative  father's  and  mother's  risks.  A  sample  calculation  is  made  to 
clarify  the  different  probabilities.  The  results  are  organized  in  an  analysis 
table  to  present  an  unbiased  and  conqD rehens ive  summary.  A  hypothesis  test  is 
proposed  for  decision  making.  This  hypothesis  test  is  designed  for  the  United 
States  legal  system  as  opposed  to  the  European  system.  A  brief  discussion  of 
the  attitudes  and  interactions  of  analysts  and  courts  is  included. 

I .  INTRODUCT ION .  Statisticians  and  people  in  other  professions  who 
deal  with  probabilities  are  often  asked  to  interpret  data  from  experiments 
whose  design  cannot  be  controlled.  When  this  data  can  be  interpreted  with 
several  techniques,  great  care  must  be  taken  to  present  an  unbiased  and 
conplete  analysis.  When  the  analysis  is  to  be  used  by  people  not  trained  in 
statistics,  a  logical  and  uncomplicated  presentation  is  extremely  important. 

The  topic  of  paternity  testing  is  a  challenge  to  statisticians,  judges, 
and  juries.  It  has  great  social  and  economic  impact  for  the  mother,  child, 
putative  father,  and  all  tax  payers  who  contribute  to  state  funded  child 
support.  The  intense  personal  involvement  creates  biased  viewpoints.  The 
antagonistic  legal  system,  as  it  is  practiced  in  the  United  States,  makes  the 
presentation  of  an  unbiased  analysis  difficult.  However,  the  people  who  must 
decide  paternity  disputes  need  all  the  infomation  they  can  obtain  in  an 
unbiased  and  unconfusing  manner. 

Courts  in  the  United  States  have  been  hesitant  to  assign  much  weight  to 
statistical  analyses  which  have  been  developed  and  used  in  Europe.  In  the 
United  States,  there  may  well  be  two  primary  reasons  that  the  use  of 
probabilities  in  paternity  disputes  has  been  unpopular  and  infrequent.  First, 
the  United  States  legal  system  starts  with  the  assumpution  that  the  putative 
father  is  not  the  true  father;  but  the  European  system  begins  with  the  opposite 
hypothesis.  Second,  most  proposed  analyses  have  been  so  inconplete  and 
fragmented  that  opposing  lawyers  could  easily  stress  biased  viewpoints  and 
create  confusion. * 
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II,  MEASUREMENTS ■  There  are  several  genetic  polymorphisms  which  yield 
information  about  paternity.  The  first  measurements  were  made  on  blood  groups 
such  as  the  ABO  system.  Now  there  are  20  to  25  genetic  marker  systems 
including  blood  groups,  protein  groups,  enzyme  groups,  and  HLA  types. 

All  available  systems4  fit  into  the  model  of  the  hypothetical  systems  of 
Appendices  A,  B,  and  C.  Any  tested  person  is  classified  according  to  phenotype. 
This  phenotype  includes  one  or  more  genotypes .  Each  genotype  contains  two  genes 
which  may  be  identical.  The  laboratory  testing  on  any  person  cannot  measure  the 
genotype  unless  the  measured  phenotype  has  only  one  genotype.  The  phenotype, 
gene,  and  genotype  frequencies  are  the  fractions  of  the  relevant  population  that 
have  those  phenotypes,  genes,  and  genotypes.  These  three  types  of  frequencies 
each  naturally  sum  to  unity.  The  numerical  values  are  independent  of  sex  but 
depend  on  the  racial  subgroup  and  geographical  area  of  the  population."  Since 
there  are  three  adults  that  must  be  considered  in  a  paternity  dispute,  three 
populations  must  be  characterized  by  gene  frequencies.  Although  these  three 
may  be  identical,  this  paper  will  consider  the  general  case;  for  the  first 
hypothetical  genetic  system,  Appendix  A  describes  the  random  man  and  Appendix  B 
describes  the  populations  from  which  the  mother  and  putative  father  are  taken. 
Appendix  C  describes  all  three  populations  for  the  second  hypothetical  genetic 
system.  Although  both  hypothetical  systems  have  obviously  been  given  exact 
frequencies  for  all  three  populations,  actual  measurement  of  gene  frequencies 
uses  sampling  and  the  desired  gene  frequencies  do  not  come  from  an  exact 
algebraic  solution.  One  approach  is  to  use  the  least  squares  technique  as 
suggested  in  Appendix  A. 

In  addition  to  gene  frequencies  describing  the  populations  of  the  random 
man,  mother,  and  putative  father,  the  exact  phenotypes  of  the  mother,  child, 
and  putative  father  are  measured.  It  is  assumed  that  no  errors  are  made  by 
the  medical  laboratory  which  types  the  mother,  child,  and  putative  father. 

The  child's  and  mother's  genetic  systems  are  further  measured  by  their 
relationship.  This  restriction  of  the  laboratory  measured  types  occurs 
because  the  child  gets  one  gene  of  each  genotype  from  the  mother  and  one  from 
the  true  father.  For  example,  if  the  mother  is  phenotype  R  and  the  child  is 
phenotype  Q,  the  mother  must  be  genotype  ab  because  she  had  to  give  gene  a 
to  the  child.  .Also,  if  the  mother  is  phenotype  Q  and  the  child  is  phenotype  R, 
the  child  must  be  genotype  ab  because  the  mother  could  not  have  given  gene  b 
to  the  child.  Occasionally  both  the  mother's  and  child's  genotypes  are 
restricted;  for  example,  an  R  mother  and  S  child  must  be  ab  and  ac,  respectively. 
This  assumption  of  true  maternity  between  the  mother  and  child  is  normally  valid 
when  the  mother  is  attempting  to  establish  paternity  of  the  putative  father; 
and  it  will  be  made  for  all  calculations  in  this  paper.  However,  it  must  be 
considered  very  suspect  in  cases  such  as  a  man  and  wife  attenpting  to  establish 
paternity  of  children  for  purposes  of  immigration. 

III.  EXCLUSIONS.  One  possible  result  of  genetic  testing  is  to  exclude 
^  putative  father  from  any  possibility  of  paternity.*  An  example  of  an 
unquestionable  exclusion  in  the  hypothetical  genetic  system  of  Appendix  C  occurs 
when  the  child  is  phenotype  JK  but  the  mother  and  putative  father  are  both 


phenotype  JJ;  the  assumption  of  maternity  requires  that  the  child's  J  gene 
was  transmitted  by  the  mother  but  the  child  also  has  a  K  gene  which  the 
putative  father  does  not  possess  and  could  not  transmit.  Medical  authorities 
are  justly  concerned  with  the  possibility  of  false  exclusions.*  A  few  examples 
are  given  here  just  to  underscore  the  fact  that  any  statistical  analysis  relies 
on  a  medical  foundation.  An  apparent  exclusion  is  obtained  if  the  mother  and 
child  are  JJ  while  the  putative  father  is  KK  in  the  hypothetical  genetic  system 
of  Appendix  C.  This  exclusion  could  be  false,  however,  if  (1)  a  rare  L  gene 
existed  in  the  system,  (2)  the  antiserum  necessary  to  detect  the  L  gene  is 
unavailable,  and  (3)  the  child  is  really  JL  and  the  putative  father  is  really 
KL.  A  second  type  of  false  exclusion  occurs  if  the  child  is  not  old  enough 
for  the  genetic  system  to  be  fully  developed.  An  example  using  the  hypothetical 
system  of  Appendix  A  would  occur  if  both  the  mother  and  putative  father  were 
U  *  be  and  the  child  tested  as  V  *  bd  but  would  become  U  *  be  when  his  or  her 

genetic  system  fully  developed.  For  this  paper,  the  genetic  systems  will  be 

assumed  to  be  completely  developed,  known,  and  tested. 

If  no  exclusion  is  obtained,  two  probabilities  may  be  calculated  as  a 
rough  indication  of  paternity.6  These  are: 

V  =  P  (a  random  man  could  be  the  father)  and 

Z  =  P(a  random  man  tests  as  if  he  could  be  the  father). 

An  alternate  way  of  viewing  the  same  information  is  to  consider: 

P'x  =  P(a  random  man  could  not  be  the  father)  and 

P£X  =  P(a  random  man  would  be  excluded  by  testing). 

Appendix  D  shows  an  example  of  these  calculations  based  on  gene  frequencies  of 
Appendices  A  and  C.  1'  will  always  be  less  than  or  equal  to  Z.  Neither  1"  or 
Z  depend  on  the  result  of  putative  father's  genetic  test  although  the  calculation 
loses  its  significance  if  the  putative  father  is  excluded.  For  both  V  and  Z, 
the  only  gene  frequencies  considered  are  those  of  a  random  man;  the  information 
about  the  mother's  or  putative  father's  racial  and/or  geographic  backgrounds  are 
not  used.  The  combination  of  Z'  or  Z  values  from  separate  sys  terns  assumes  that 
these  systems  are  independent. 

The  discrimination  of  genetic  systems  may  be  measured  by  average  values 
of  the  probabilities  of  excluding  falsely  accused  putative  fathers.  This 
calculation,  which  is  illustrated  in  Appendix  E,  utilizes  the  probabilities 
of  occurrence  of  random  mother-child  combinations ,  the  gene  frequencies  of  the 
mother's  population,  and  the  gene  frequencies  of  a  random  man. 

IV.  CONDITIONAL  PROBABILITIES.  The  existence  of  the  putative  father's 
phenotype  may  be  used  in  the  calculation  of  conditional  probabilities  which  give 
a  crude  likelihood  of  paternity.7  These  are  defined  as: 
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Y'  =  P(the  putative  father's  phenotype  occurs  in  random  men)  and 

X'  =  P(the  putative  father's  phenotype  occurs  in  possible  fathers 
having  the  putative  father’s  gene  frequencies). 

Appendix  F  shows  this  calculation  for  the  case  started  in  Appendix  D  using  the 
genetic  systems  of  Appendices  A,  B,  and  C. 

Conditional  probabilities  containing  more  information  may  be  calculated 
on  the  basis  of  gene  transmission.*  These  probabilities,  which  also  use  the 
putative  father's  phenotype  and  the  gene  frequencies  of  his  population,  are 
defined  as: 


Y  =  P(the  necessary  genes  to  produce  the  child  were  transmitted)  and 

X  =  P(the  necessary  genes  to  produce  the  child  were  transmitted 
to  the  mother  if  the  putative  father  was  the  father). 

Appendix  G  shows  this  calculation  for  the  example  considered  in  Appendices  D  and 
F.  The  calculation  of  gene  transmissions  is  illustrated  in  the  demonstration 
that  the  probability  of  a  random  man  transmitting  a  gene  is  just  the  gene 
frequency  of  a  random  man.  The  consideration  of  the  genetically  possible  ways 
that  the  mother  and  a  random  man  or  putative  father  can  transmit  genes  makes  Y 
and  X  more  informative  than  Y'  and  X'. 

The  paternity  index  is  sometimes  defined9  as  XV  Y"  and  sometimes1  as  X/Y. 
As  the  exanple  of  Appendices  F  and  G  show,  these  are  not  the  sane.  This 
illustrates  that  users  of  probabilities  in  paternity  disputes  must  be  very 
careful  with  their  definitions. 

The  combination  of  X^'s  and  Y^'s  from  different  genetic  systems  into 

a  composite  X  and  Y  requires  independence  of  the  systems  in  order  for  the 
multiplication  procedure, 

X  -  X1X2  ' ' '  Xfinal  system 

Y  ’  Y1Y2  •  • '  Yfinal  system  - 

to  be  valid.  When  systems  are  not  independent,  they  must  be  combined  into 
a  single  system  as  indicated  in  Appendix  H. 

V.  BAYES  FORMULA.  The  likelihood  of  paternity  for  the  putative  father 
is  given  by 

W  =  P(the  putative  father  is  the  father  given  that  the  necessary 
set  of  genes  have  been  transmitted  to  the  mother  to  produce 
the  child). 
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Since  the  child  exists  and  accurate  genetic  testing  is  assumed,  W  reduces  the 
probability  that  the  putative  father  is  the  father.  Examination  of  X  and  Y 
shows  that  W,  X,  and  Y  are  related  by  Bayes  Formula.1  By  using  P  and  I-P  to 
denote  a  priori  probabilities  of  paternity  for  the  putative  father  and  random 
man,  W  may  be  written  as 


The  a  priori  probability  P  is  an  integral  part  of  the  calculation  of  W. 

As  seen  in  Appendix  I ,  which  continues  the  example  started  in  Appendix  D,  the 
selection  of  a  number  for  P  influences  the  numerical  value  of  W.  In  some 
paternity  calculations,  P  is  set  at  one-half J ;  the  argument  for  this  substitution 
is  that  nothing  is  really  known  other  than  the  genetic  test  results.  In  other 
analyses,  some  values  such  as  seven- tenths  is  used  on  the  assumption  that  this 
is  the  proportion  of  valid  paternity  claims  which  women  bring  to  court.7 
Obviously,  the  judge  and/or  jury  should  consider  the  non-genetic  evidence  in 
each  case  and  subjectively  establish  the  a  priori  value  P. 

VI.  STATISTICAL  RISKS.  The  putative  father  may  be  a  random  man  whose 
genetic  characteristics  just  happen  to  yield  a  high  W  value.10  The  likelihood 
of  this  happening  to  a  man  who  is  not  the  father  is 

aPUF  min  =  p (finding  a  possible  father  whose  likelihood  of 
’  paternity  is  as  great  or  greater  than  Wp^p  when 

the  putative  father  and  possible  fathers  are 
random  men). 

A  calculation  of  this  minimum,  putative  father's  risk  is  illustrated  in 
Appendix  J  for  the  example  started  in  Appendix  D. 

The  putative  father  may  be  the  father  but  the  genetic  characteristics 
of  the  mother,  child,  and  father  may  just  happen  to  yield  a  low  W  value.10 
The  likelihood  of  this  happening  is 

=  P(the  mother  and  putative  father  having  a  child  which 
’  results  in  the  putative  father's  likelihood  of 

paternity  being  as  low  or  lower  than  WpLJF  when  the 
putative  father  is  the  father). 

A  calculation  of  this  minimum  mother's  risk  is  illustrated  in  Appendix  K  for 
the  exajiple  started  in  Appendix  D.  This  risk  becomes  the  state's  risk  if  the 
state  is  to  pay  child  support  when  neither  the  mother  or  the  putative  father 
provide  it. 

VII.  HYPOTHESIS  TESTING.  A  comprehensive  and  combined  use  of  the 
probabilities  in  a  paternity  dispute  may  be  made  with  a  hypothesis  test.10 
The  null  and  alternate  hypothesis  with  the  associated  risks  are: 
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Hq:  The  putative  father  is  not  the  father  a 

The  putative  father  is  the  father  £ 

where 

a  =  P(rejecting  the  null  hypothesis,  that  the  putative  father 
is  not  the  father,  when  he  really  is  not  the  father)  and 

&  =  P (accepting  the  null  hypothesis,  that  the  putative  father 
is  not  the  father,  when  he  really  is  the  father). 

This  hypothesis  test  is  structured  according  to  the  traditional  assumption  of 
the  United  States  legal  system.  Innocence  (i.e.,  non-paternity)  is  assumed  as 
long  as  there  is  reasonable  doubt  of  guilt  (i.e.,  paternity).  This  hypothesis 
test  may  be  performed  by  using  the  analysis  table  of  Appendix  L;  this  completes 
the  numerical  example  of  Appendices  D,  F,  G,  I,  J,  K,  and  L  which  uses  the 
hypothetical  genetic  systems  of  Appendices  A,  B,  and  C  whose  discrimination 
in  an  average  paternity  dispute  is  calculated  in  Appendix  E. 

A  court's  use  of  the  analysis  table  certainly  does  not  relieve  the 
court  of  its  judgment.  In  fact,  the  court  must  make  knowledgeable  and 
judicious  determinations  to  use  the  procedure.  This  seven-step  procedure  is 
described  in  Appendix  M.  The  result  of  the  procedure  may  very  well  be  that  a 
decision  cannot  be  made  without  either  obtaining  more  genetic  information  or 
lowering  the  risks  that  the  court  is  willing  to  take.  This  need  for  more 
definitive  tests  may  occur  for  the  reasons  listed  in  Appendix  N.  The 
hypothetical  example  calculated  in  this  paper,  which  culminated  in  Appendix  L, 
is  not  very  definitive.  The  results  of  a  more  definitive  example  is  shown  xn 
Appendix  0.  Appendix  P  shows  possible  uses  of  this  analysis  table. 

The  number  of  actual  cases  that  have  been  analyzed  by  the  technique 
proposed  in  this  paper  is  quite  small.  For  these  cases.  Appendix  Q  presents 
(1)  the  line  of  the  analysis  table  which  has  the  putative  father's  posterior 
probabilities  of  paternity  and  (2)  the  entry  in  the  a  column  which  is  either  on 
or  just  above  the  putative  father's  line.  The  values  were  calculated  from  six 
red  cell  antigen  systems  recommended  by  the  Texas  Society  of  Pathologists;  these 
are  ABO,  MNSs ,  Rh,  Kell,  Duffy,  and  Kidd  systems.  These  analyses  were  done 
before  the  availability  of  HLA  testing  so  they  are  not  as  definitive  as 
currently  possible.  Although  a  lack  of  small  a  and  &  risks  may  disturb  the 
analyst,  the  results  in  Appendix  Q  aTe  useful  when  the  non-genetic  evidence 
indicates  a  high  value  of  P;  for  each  case,  a  hypothesis  test  may  be  used  in 
ruling  against  the  putative  father.  The  fact  that  a  test  has  a  high  &  risk 
has  little  meaning  when  the  ruling  favors  the  mother  anyway;  the  important 
risk  is  that  of  the  party  who  is  not  favored  by  the  ruling. 

Till.  DISCUSSION.  The  basic  philosophy  behind  the  use  of  probabilities 
in  paternity  disputes  is  that  all  available  information  should  be  used  in 
paternity  judgments.  Unless  probabilities  are  used,  genetic  tests  add  nothing 
to  the  decision  making  process  when  the  putative  father  is  not  excluded. 
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Although  probabilities  can  never  result  in  certainty  of  the  necessary  decision, 
they  can  certainly  add  information  when  no  exclusion  is  obtained.  Even  if  an 
apparent  exclusion  is  obtained  in  one  genetic  system,  probabilities  based  on 
other  systems  can  be  used  to  get  an  indication  of  the  possibility  of  a  false 
exclusion. 1 

Since  genetic  tests  are  common  and  even  required  in  many  states  before 
a  paternity  dispute  can  be  brought  to  court,  all  possible  information  from 
the  tests  should  logically  be  available  for  the  court.  The  difficulty  of 
presenting  probablistic  information  is  that  many  different  probabilities  can 
be  discussed  and  confusion  easily  results.  This  is  the  reason  for  organizing 
the  results  in  an  analysis  table  as  shown  in  Appendices  L  and  0.  The  court 
should  have  an  unbiased  presentation  of  probabilities;  the  analysis  table 
provides  such  a  presentation. 

Judges  and/or  juries  should  be  hesitant  to  use  partial  probablistic 
arguments  which  can  easily  be  given  biased  interpretations  by  lawyers  in  an 
antagonistic  legal  system.  Since  no  information  is  normally  provided  about 
the  putative  father's  and  mother's  risks  denoted  in  this  paper  by  a  and  8, 
the  courts  are  justifiably  hesitant  to  apply  much  weight  to  the  probabilities 
that  are  presented.  This  hesitancy  is  especially  justified  when  the  probablistic 
argument  does  not  let  the  court  choose  the  a  priori  probability  denoted  by  P 
in  this  paper.  A  useful  analysis  must  include  all  genetic  and  non-genetic 
information.  It  also  must  describe  all  risks  involved  without  unduly 
enphasizing  any  one  of  them. 

There  is  occasionally  a  hesitancy  to  use  a  decision  making  process  that 
admits  any  uncertainty  La  the  final  decision.  This  attitude  can  never  be 
conpletely  eliminated;  but  the  person  performing  and/or  presenting  a  probability 
analysis  must  stress  that  certainties  are  very  rare  and  analytical  consideration 
of  uncertainties  is  essential  to  intelligent  decision  making. 

There  is  occasionally  a  reluctance  to  perform  a  long  tedious  calculation. 
This  attitude  can  influence  the  preparation  of  the  procedure  proposed  in  this 
paper  be.ause  the  mutter  of  calculations  expands  rapidly  when  the  lists  of 
possible  fathers  and  possible  children  lengthens  as  more  genetic  systems  are 
added  to  the  calculation.  There  is  an  obvious  and  successful  method  of  avoiding 
the  tedium  and  possibility  of  numerical  mistakes;  the  whole  procedure  may  be 
conputerized.  To  write  the  necessary'  program,  it  has  naturally  been  necessary 
to  use  both  logical  and  arithmetic  programing  techniques. 

There  is  a  hesitancy  of  some  analysts  to  use  any  hypothesis  test  which 
does  not  have  very  low  values  for  both  the  a  and  8  risks.  The  analyst’s  job 
in  paternity  disputes  is  not  to  set  risks.  The  analyst  must  explain  the 
meaning  of  the  risks  to  the  judge  and/or  jury;  the  court  must  then  assign  the 
numerical  values  to  all  parameters  that  affect  the  use  of  the  analysis  table. 

The  court  may  very  well  wish  to  assign  different  limits  on  the  risks  of  the 
putative  father  and  the  mother  or  state.  This  decision  involves  the  relative 
cost  of  making  a  mistake;  and  only  the  holders  of  ultimate  responsibility 
should  set  risks  of  making  mistakes. 
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There  is  a  reasonable  hesitancy  of  analysts  to  use  imprecise  data.  This 
leads  to  two  concerns  in  the  calculation  of  paternity  probabilities.  First, 
the  combination  of  the  results  from  different  genetic  systems  requires  the 
different  systems  to  be  independent.  If  this  is  not  true,  a  combined  system 
such  as  illustrated  in  Appendix  H  must  be  used.  If  the  gene  frequencies  for 
this  single  system  are  not  available,  the  only  prudent  courses  of  action  are 
(1)  ignore  both  systems  or  (2)  present  two  analyses  with  each  considering  one 
of  the  two  non-independent  systems.  It  is  extremely  dangerous  to  use  the  system 
which  the  analyst  consideres  to  yield  the  most  informative  results;  this  is  done 
in  some  European  analyses1  but  it  amounts  to  selecting  data  which  will  yield  a 
result  of  predetermined  bias.  Second,  the  gene  frequencies  of  the  genetic 
systems  must  be  known  for  all  involved  populations.  Although  the  medical 
profession  must  collect  the  data  for  these  populations,  there  is  no  real  medical 
application  for  gene  frequencies.  This  is  especially  true  for  a  combined  system 
made  from  two  or  more  non-independent  systems.  When  the  analyst  is  not  given 
accurate  population  gene  frequencies ,  he  must  repeat  the  calculation  for  several 
sets  of  gene  frequencies  which  surround  the  true  population  frequencies. 

Selecting  the  frequencies  to  use,  interpreting  the  results  for  each  system, 
and  combining  the  results  of  the  different  systems  requires  more  analytical 
judgment  then  merely  inputting  data  to  a  computer  and  reading  the  final  result. 
This  is  the  real  job  of  the  analyst. 

Perhaps  the  most  important  prerequisite  for  widespread  use  of  probabilities 
in  paternity  cases  is  a  standardized  format.  There  are  many  probabilities 
involved  and  many  people  need  to  use  them.  Confusion  will  be  both  possible  and 
likely  until  some  standardized  presentation  is  accepted.  This  needed  standardi¬ 
zation  is  one  inportant  reason  for  the  use  of  the  analysis  table  shown  in 
Appendices  L  and  0.  One  method  of  obtaining  standardization  would  be  for  state 
legislatures  to  prescribe  a  format  for  information  to  be  presented  in  court. 

If  the  state  legislature  did  this,  they  might  very  well  want  to  provide  guidance 
in  the  numerical  values  to  be  used  for  the  limiting  risks  <*L  and  8L  and  the 

decision  level  W'^  that  are  required  in  the  procedure  of  Appendix  M.  This 

guidance  would,  of  course,  have  to  provide  a  list  of  feasible  genetic  systems 
and  a  procedure  to  be  followed  if  this  list  were  exhausted  without  yielding 
sufficiently  low  values  of  test  risks  ciy  and  By  to  result  in  a  decision. 

Finally,  state  legislatures  might  very  well  want  to  direct  state  organizations 
such  as  medical  schools  to  compile  gene  frequencies  for  genetic  systems  and 
combinations  of  systems  that  are  not  well  known  to  be  independent. 


IX.  PROSPECTS.  The  use  of  probabilities  in  paternity  disputes  is  very 
fragmentary  in  the  United  States.  It  is  much  more  prevalent  in  European  courts. 
Differences  in  the  European  and  United  States  legal  systems  have  prevented  the 
spread  of  the  technique  to  the  United  States. 


The  use  of  probabilities  will  undoubtedly  grow  in  the  United  States.  One 
reason  is  that  scientific  methods  and  statistical  analyses  are  continuously 
becoming  more  understood.  Another  is  that  future  availability  of  the  putative 
father's  and  mother's  risks,  which  are  not  used  in  European  courts,  should 


satisfy  objections  that  an  inconplete  and  even  erroneous  description  can  be 
obtained  by  focusing  all  of  the  court's  attention  on  a  single  and  non- 
comp  rehen  si  ve  probability. 
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APPENDIX  A 


FIRST  HYPOTHETICAL  GENETIC  SYSTEM  FOR  RANDOM  MAN 


PHENOTYPE 
FREQUENCIES  [P(I)] 

P(Q)  *  .16 
P(R)  -  .33 
P(S)  =  .24 
P(T)  *  .09 
P(U)  =  .12 
P(V)  =  .06 

RELATIONS  USED  TO  FIND  GENE  FREQUENCIES  [PCD]: 

P(Q)  *  [P(a)]2 

P(R)  «  [P(b)]2  *  2P(a)P(b) 

P(S)  =  [P(c)]2  ♦  2P(c)P(d)  +  2P(a)P(c) 

P(T)  *  (P(d)]2  *  2P(a)P(d) 

P(U)  =  2PCb)P(c) 

POO  =  2P(b)P(d) 


PHENOTYPES  [I] 

Q 

R 

S 

T 

U 

V 


POSSIBLE 
GENOTYPES  [ij] 

aa 

bb.  or  ab 
cc,  cd,  or  ac 
dd  or  ad 
be 
bd 


LEAST  SQUARES  SOLUTION  FOR  P(i)’s  MINIMIZES  62: 

62  =  (P(Q)  -  [P(a)]2}2  +  (P(R)  -  [P(b)]2  -  2P(a)P(b)}2 
+  [P(S]  -  [PCD]2  -  2P(c)P(d)  -2P(a)P(c)}2 
+  CP(T)  -  [P(d)]z  -  2P(a)P(d)}2  +  (P(U)  -  2P(b)P(c)}2 
♦  (P(V)  -  2P(b)P(d)}2 


GENE  FREQUENCIES: 


i 

a 

b 

c 

d 

PCD 

.4 

.3 

.2 

.1 

GENOTYPE  FREQUENCIES  [P(ij)J; 

a 

b 

c 

d 

a 

.16 

b 

.24 

.09 

c 

.16 

.12 

.04 

d 

.08 

.06 

.04 

.01 
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APPENDIX  B 


FIRST  HYPOTHETICAL  GENETIC  SYSTEM  FOR  ADULTS  IN  THE  CASE 


MOTHER 


GENE  FREQUENCIES: 

i _ i 

'P(i)  ~ 

GENOTYPE  FREQUENCIES: 


a 

b 

c 

d 

a  .1764 

b  .2604 

.0961 

c  .1596 

.1178 

.0361 

d  .0672 

.0496 

.0304 

.0064 

PHENOTYPE  FREQUENCIES: 


I 

Q 

R 

S 

T 

U 

V 

1 

P(I) 

.1764 

.3565 

.2261 

.0736 

.1178 

.0496 

PUTATIVE  FATHER 

GENE  FREQUENCIES: 


i 

a 

b 

c 

d 

P(i) 

.38 

.29 

.21 

.12 

GENOTYPE  FREQUENCIES: 


a 

b 

c 

d 

a  .1444 

b  .2204 

.0841 

c  .1596 

.1218 

.0441 

d  .0912 

.0696 

.0504 

.0144 

PHENOTYPE  FREQUENCIES: 


I 

Q 

R 

S 

T 

U 

V 

P(I) 

.1444 

.3045 

.2541 

.1056 

.1218 

.0696 

475 


APPENDIX  D 


PROBABILITIES  OF  EXCLUSION  IS  A  SPECIFIC  CASE 


DEFINITIONS 


M 

C 

POF 

RM 

G 


2 


P 


ex 


P 


ex 


Mother 

Child 

Possible  Father 
Transmits 
Random  Man 
Gene  POF  Must  -*• 

P(RM  could  -  G) 

P(RM  tests  as  if  he  could  ■+ 
P(RM  could  not  -*■  G) 

P(RM  is  Excluded  by  Tests) 


G) 


SYSTEM  1 

M  =  S  *  cc,  dc,  or  ac 
C  =  T  -  dd  or  ad 
M  *  cd  or  ac 
POF  -*■  a  or  d 

POF  =  aa,  ab,  ac,  ad,  bd,  cd,  or  dd 
POF  =  Q,  R,  S,  T,  or  V 
V  =  P(RM  =  aa,  ab,  ac,  ad,  bd,  cd,  or  dd) 
=  .16  +  .24  +  .16  +  .08  +  .06  +  .04  +  . 


SYSTEM  2 

M  =  JJ 
C  =  JK 


POF  K 

POF  *  JK  or  KK 


01 


Z'  =  P(RM  =  JK  or  KJC) 
■=  .42  +  .49 


=  .75 

Z  =  P(RM  *  Q,  R,  S,  T,  or  V) 

=  .16  +  .33  +  .24  ♦  .09  +  .06 


=  .91 
Z  ■»  V 
=  .91 


=  .88 


COMBINATION 

V  =  Z'  Z'  =  (.73)  (.91)  =  .6825 
Z  *  Z^  =  (.88) (.91)  »  .8008 


P'  =  1  - 
ex 

V  “  1  - 

. 682S  * 

.3175 

P  =  1  - 
ex 

Z  =  1  - 

.8008  = 

.1992 
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appendix  e 

AVERAGE  PROBABILITY  OF  EXCLUSIONS 


SYSTEM  2 


M 

POO 

C 

P(CiM) 

POF 

2 

JJ 

.16 

JJ 

(1)( -3) 

JJ,  JK 

.09  ♦  .42 

JJ 

.16 

JK 

(DC-7) 

JK,  KK 

.42  ♦  .49 

! 

JK 

.48 

JJ 

(1/2) (.3) 

JJ,  JK 

.09  ♦  .42 

tv 

.48 

JK 

(1/2) (.3) 

JJ,JK,KK  ! 

.09  *  .42  .49 

♦  (1/2) C- 7) 

JK 

.48 

KK 

(1/2)  (.7) 

JK,  KK 

.42  ♦  .49 

KK 

.36 

JK 

CD  (-3) 

JJ,  JK 

.09  +  .42 

KK 

.36 

KK 

(D(.7) 

JK,  KK 

.42  ♦  .49 

POC  IN 

StSPi  2  P.(M)  Pi(C|M)  2.  *  .8404 

L*  1  1  1 


SYSTEM  1 


Q 

Q 

Q 

Q 


R 

R 


R 

R 

R 


.1764 

.1764 

.1764 

.1764 


.3565 

.3565 


.3565 

.3565 

.3565 


.3565 


Q 

R 

S 

T 


Q 

R 


S 

T 

U 


aa 

aa 

aa 

aa 


ab 

bb,ab 


ab 

ab 

bb  ,ab 


bb.ab 


aa 

ab 

ac 

ad 


aa 

bb,ab 


ac 

ad 

be 


bd 


Cl)  (.4) 
Cl) (-3) 
(l)(-2) 
(D(.D 


(.2604/. 3565) (1/2) (.4) 
(.0961/. 3565) (1) (.3+. 4) 

+  ( . 2604/ . 3565) [ (1/2) ( . 3) 

♦(1/2)  (.>.4) 
(.2604/. 3565) (1/2)  (.2) 

(. 2604/. 3565) (1/2) (.1) 
(.0961/. 3565) (1)  (.2) 

♦  (.2604/. 356 5) (1/2) (.2) 
(.0961/. 3565) (l)(.l) 


— 1  — i  .H  C/l  C/lC/J  C/3U1CO 


.2261 

.2261 

.2261 


.1178 

.1178 

.1178 

.1178 

.0496 

.0496 

.0496 

.0496 

.0496 


ac 

ac 

cc,cd, 

ac 


cd,ac 

cc,cd, 


ad 

ad 

dd,ad 


dd,ad 


dd.ad 


aa 

ab 

cc.dc 

ac 


dd.ad 


aa 

ab 

cd.ac 


dd.ad 


bb.ab 
cc, cd.ac 
be 
bd 

bb  ,ab 
cd 

dd,ad 

be 

bd 


P  (C 1 M)  _ _ _ 

(.159S/.2261)(l/2)(.4) 

(.1596/. 2261) (1/2) (.3) 

(.0361/. 2261) (1) (.2+.1+.4) 

♦  (.0304/. 2261) [(1/2) (.2+.1+.4) 

*  (1/2)  (.2)  ] 

+  (. 1S96/.2261)  1(1/2)  (.2) 

♦  (1/2) (.2+.1+.4)] 

(. 0304/. 2261) (1/2) (.1+. 4) 

+  (. 1596/. 2261) (1/2) (.1) 

(.0361/ .2261) (1) (. 3) 

+  (.0304/ .2261) (1/2) (. 3) 

♦  (.1596/. 2261) (1/2) (.3) 

(.0304/ . 2261) (1/2) (.3) 

(.0672/. 0736) (1/2) (.4) 

(.0672/ .0736) (1/2) (. 3) 

( .0064/ .0736) (1) (.2) 

♦  (.0672/. 0736) [(1/2) (.2) 

♦  (1/2)  (-2)) 
(.0064/. 0736) (1) (. 1+.4) 

«•  (.0672/. 0736) { (1/2)  (. 1) 

♦  (1/2) (.l+.4)] 

( .0064/ .0736) (1)  ( .  3) 

♦  (.0672/. 0736) (1/2) (.3) 

(1/2) ( • 3+ . 4) 

(1/2) (.2+.1+.4) 

(1/2)  ( .2)+  (1/2)  (.  3) 

(1/2) (.1) 

(1/2) (. 3+ .4) 

(1/2) ( . 2) 

(1/2) (.1+ .4) 

(1/2) (.2) 

(1/2) C - 1) ♦ (1/2) ( -  3) 
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GENES  THE  FATHER  MUST  PROVIDE 
AS  INDICATED  BY  .4,  .3,  .2, 

AND  .1  FOR  a,  b,  c,  AND  d, 
RESPECTIVELY 

Z' 

Z 

.4 

. 16+ . 24+ . 16+ . 08“ . 64 

.16+.33+.24+.09=.82 

.3 

.24+. 09*. 12+. 06*. 51 

.33+.12+.06*.51 

.2 

. 16+ . 1 2+ . 04+ . 04* .  36 

.24+.12-.36 

.1 

.08+.06+.04+.01=.19 

.24+.09+.06-.39 

.4  and  .3 

.16+.24+.16+.08 

+.09+.12+.06=.91 

.16+. 33+.24+.09 

+.12+. 06=1.0 

. 4  and  . 1 

. 16+.24+.16+.08 
+.06+.Q4+.0l=. 75 

.16+.33+.24+.09 

+.06=. 88 

. 3  and  . 2 

•24+.09+.12+.06 

+.16+.04+.04-.7S 

.33+.12+.06 

+.24=. 75 

. 3  and  . 1 

.24+.09+.12+.06 

+.08+.04+.01*.64 

.33+.12+.06 

+.24+.09-.84 

.4,  ,2,  and  .1 

.16+.24+.16+.08 

+.12+.04+.04 

+.06+.01=.91 

.16+.33+.24+.09 
+  .12 

+.06=1.0 

POC  IN 
SYSTEM  1 

Zl\ave  =  I  Pi^  Pi(ClNI)  Z'i  =  -6771 
i 


POC  IN 
SYSTEM  1 

Zl>ave  =  l  PiM  VCIM)  Zi  "  -7646 


WWINATIQN 

Z'  =  l{  V2  =  (.8404)  (.67713  =  .569 
Z  *  Zj  Z2  =  (.8404) (.7646)  =  .643 
V'  =  1  -  1'  =  1  -  .569  =  .431 


p  =i-7  =  i-  .fiax  = 


APPENDIX  F 


CONDITIONAL  PROBABILITIES  BASED  ON  EXISTENCE  OF  PHENOTYPES 


ADDITIONAL  DEFINITIONS 


F 

Father 

PUF 

Putative  Father 

Y' 

P(PUF's  Phenotype |M's  and  C's  Phenotypes  and  PUF^F) 

X' 

P(PUF's  Phenotype |M's  and  C's  Phenotypes  and  PUF=F) 

SYSTEM  1 

SYSTEM  2 

M  S  =  cc,  cd,  or  ac 

M  =  JJ 

C  T  =  dd  or  ad 

C  =  JK 

K  cd  or  ac 

P.  -  a  or  d 

POF  -  K 

POF  =  Q,  R,  S,  T,  or  V 

POF  *  JK  or  KK 

PUF  =  S 

PUF  =  JK 

Y'  =  P(PUF  =  S|PUF  =  RM) 

=  .24 

Y'  =  P(PUF  =  JK j PUF  *  RM) 

=  .42 

X'  *  P(PUF  =  Sj PUF  =  Q,  R,  S,  T,  or  V) 

*  .  2541/ ( . 1444+. 3045+ . 2541+ . 1056+ . 0696) 

*  .2893 

X'  =  P(PUF  =  JK|  PUF  =  JK  or  KK) 
*  .32/ (.32+. 64) 

=  .3333 

Y'/X'  =  .2400/. 2893 
*  .8296 

Y'/X'  =  .4200/. 3333 
=  1.260 

COMBINATION 

Y'  =  Y'  Y'  =  (.2400) (.4200)  *  .1008 
X'  *  X'  X'  =  (.2893) (.3333)  =  .09642 
Y'/X'  =  .1008/. 0964  *  1.05 
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SYSTEM  1  (Continued) 

SYSTEM  2  (Continued) 

X  =  P(M=cd)P(cd+d)[P(PUM)+(PlJF*a)] 

X  =  P(JJ-J)P(PUP+K) 

+P  (M=ac)P (ac-a) ♦  (PUF-d) 

*  1(1/2) 

=  (. 1600) (1/2) (.0992+. 3140) 

=  .500 

+(.8400) (1/2) (.0992) 

=  .0747 

Y/X  =  .0820/. 0747  =1.10 

Y/X  =  .7/. 5  *  1.40 

COMBINATION' 

Y  =  Yj  Y2  =  (.0820) (.7)  =  .0574 
X  ■  Xj  X2  =  (.0747) (.5)  =  .0374 


Y/X  =  .0574/. 0374  =1.53 


APPENDIX  H 


TABLE  OF  COMBINED  SYSTEM  TO  REPLACE 
THE  HYPOTHETICAL  SYSTEMS  1  AND  2  WHEN  THEY  ARE  NOT  INDEPENDENT 


Q 

R 

s 

T 

u 

V 

JJ 

PHn 

PHl2 

PHl3 

W14 

*15 

W16 

JK 

ph2i 

ph22 

ph23 

pn24 

ph25 

^26 

KK 

ph3i 

ph32 

ph33 

ph34 

ph3S 

W36 

The  lumber  of  phenotypes  in  a  combined  system  is  the  product  of  the  nunber  of 
phenotypes  in  the  individual  systems. 


APPENDIX  I 

APPLICATION  OF  BAYES  THEOREM 


ADDITIONAL  DEFINITIONS 


W 

P 


p (PDF  -  C>any  ^|G  was  -) 


P(PUF*F  on  the  basis  of  all  non-genetic  information) 


A 


APPENDIX  J 


XPUF 

X  of  the  PUF 

P1 

X  of  a  POF  of  I'th  Phenotype  from  RM  Population 

w 

WPUF 

W  of  the  PUF 

WPOF 

W  of  a  POF 

PUF, min 

pCWpOF  i  WpuF|PUF  =  RM) 

SYSTEM  1 

1  SYSTEM  2 

M  -  S  =  cc,  cd,  or  ac 
C  =  T  •>  dd  or  ad 
M  3  cd  or  ac 

P(M*cd)  *  .16;  P (M=ac)  *  .84 
Y  =  .0820 

Y/Xpjp  =  1-10 

POF  ■»  a  or  d 

POF  *  Q,  R,  S,  T,  or  V 


M  *  JJ 
C  *  JK 


(from 

Appendix  G)  Y  =*  P(M*J)  P(RM-K)  -  (1)(.7) 

Y/Xpup  '  1.4  (from  Appendix  G) 

POF  +  K 
POF  •  JK  or  KK 


X^*1  =■  P(M=cd)P(cc^-d)lP(RMI^d)-*-P(RMI-*a)]  XP0Fl1  -  P(M-J)P(RMj-<-K) 


+  P(M=ac)P(ac+a)P(RMj-*d) 

=  (.16)  (1/2)  (0+1)+ (.84)  (1/2)  (0) 
>  .0800 

^  *  (-16) (1/2) [0+(.24/.33) (1/2)| 

♦  (.84) (1/2) (0) 

-  .0291 

XP°F=S  =  (_ 16) (1/2) [ (.04/ .24) (1/2) 


^OF-JK 

xP0F»KK 


(1) (1/2) 

.500 

(1)(1) 

1.00 


XP0F,1 


+(.16/ .24) (1/2)] 

+  (.84) (1/2) (.04/. 24) (1/2)  3  .0683 

(•16)  (1/2)  [  (.01/ .09)  (1) 

+(.08/. 09) (1/2) 

+(.08/. 09) (1/2)) 

+  (• 84) (1/2) ( (.01/ .09)  (1) 

+(.08/ .09) (1/2) ] 

.315  486 


Si' STEM  1  (Continued) 

SYSTEM  2  (Continued) 

XTOF=V  =  (.16)  (1/2)  (.5+0)  +  (.84)  (1/2)  (.5) 

=  .250 

Y/X^Q  =  .082/. 0800  =  1.02 

Y/XFOpaJK  =  .7/.500  =  1.40 

Y/XP0F=R  =  .082/. 0291  =  2.82 

y/xP0F=KK  =  -7/1<00  s  .700 

Y/XP0FsS  =  .082/. 0683  =1.20 

Y/XP0FbT  =  082/. 313  .  .262 

Y/XP0F=V  =  .082/. 250  =  .328 

COMBINATION 

Y/XPUF  =  ^l^l.PUF-1  ^V^PU^  =  1,53 


POP 

p(pof)  =  pappap 

(Y/X)  =  (Y/X)10’/X)2 

IS  (Y/X)  <  OVXpjjp) 

Q  and  JK 

(.16)  (.42)  =  .0672 

(1.02) (1.40)  =  1.43 

YES 

Q  and  KK 

(.16)  (.49)  *  .0784 

(1.02) (.700)  *  .714 

YES 

R  and  JK 

(.33) (.42)  =  .1386 

(2.82) (1.40)  =  3.95 

NO 

R  and  KK 

(.33) (.49)  =  .1617 

(2.82) (.700)  =1.97 

NO 

S  and  JK 

(.24) (.42)  =  .1008 

(1.20) (1.40)  »  1.68 

NO 

S  and  KK 

(.24) (.49)  =  .1176 

(1.20) (.700)  =  .840 

YES 

T  and  JK 

(.09) (.42)  =  .0378 

(.262) (1.40)  =  .367 

YES 

T  and  KK 

(.09) (.49)  »  .0441 

(.262M.700)  =  .183 

YES 

V  and  JK 

(.06) (.42)  =  .0252 

(.328) (1.40)  =  .459 

YES 

V  and  KK 

(.06)  (.49)  =  .0294 

2  *  1^2  =  .8008 

(, 328) (.700)  =  .230 

YES 

aPUF,min 


all  i  for  which 

(Y/XjJitV/Xpup) 


i 


P.(POF) 


.3997 


* 
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APPENDIX  K 


MINIMUM  MOTHER’S  RISK 


ADDITIONAL  DEFINITIONS 


POC 

Possible  Child  of  M  and  PUF 

l 

YOI 

Y  foT  a  Quid  of  Phenotype  I 

« 

X  for  a  Child  of  Phenotype  I  when  the  PUF  has  Phenotype  J 

P(POC=I) 

Probability  that  M  and  PUF  will  have  Child  of  Phenotype  I 

WPOC 

W  for  PUF  if  PUF  and  M  had  POC 

P<Wroci'WPUF  ■  F) 

SYSTEM  1 

SYSTEM  2 

P  =  S  *  cc,  cd,  or  ac 

M  -  JJ 

C  *  T  *  dd  or  ad 

C  *  JK 

M  =  cd  or  ac 

Unless  M  has  C/T,  P(M=ac)  ■  .84*1  from 

P(M=cd)  =  .16J  Appendix  G 

PUF  *  S  ■  cc,  cd,  or  ac 

PUF  =  JK 

POC  -  aa,  ac,  ad,  cc,  cd,  or  dd 

POC  -  JJ  or  JK 

POC  «  Q,  S,or  T 

If  POC  *  Q,  M  =  ac 

P(M=ac)  =  1 

If  POC  *  S,  M  *  cd  or  ac 

P(M®cd)  ■  .16;  P(M»ac)  =  .84 

If  POC  a  T,  M  *  cd  or  ac 

P(M*cd)  =  .16;  P(M*ac)  ®  .84 

P(PUF-*-a)  -  (.1596/. 2541)  (1/2)  -  .3140 

P(PUF*J)  -  1/2 

P(PUP*c)  *  (. 1596/. 2541*. 0504/. 2541)  (1/2) 

♦  (. 0441/. 2541)  (1)  »  .5868 

P(PUP+K)  -  1/2 

P(PUF-d)  -  (.0504/. 25411(1/2)  *  .0992 


SYSTEM  1  (Continued) 

SYSTEM  2  (Continued) 

Yr  n  =  P(M=ac)P(ac-*-a)P(RM+a) 

=  (1) (1/2) (.4)  =  .200 

Yr  ..  =  P(JJ-*\J)P(RM*.J) 

L  =  (1) (. 3)  =  .300 

*  P (M=ac)P  (ac-*-a) P (PUF*a) 

=  (1) (1/2) (.3140)  =  .157 

XPUF*JK  a  p(jj-j)p(puji^j) 

=  (1)  (1/2)  -  .500 

yc«q/xc=Q=S  =  *200/,1S7  =  ll27 

yc=jj'/xc=jj  K  =  =  ,60° 

P(POC-Q)  -  P(M=ac)P(aoa)P(PUF*a)  ' 

-  (.84) (1/2) (.314) 

=  .132 

P(POC=JJ)  =  x^JK 
=  .500 

Yr  ~  *  P(M=ac)  (P(ac*-a)P(RM*-c) 

♦P(ac-K:)P(RM-»a,c,  or  d)] 

+  P  (M=cd)  [  P  (cd-*d)  P  (RNP*c) 

+P(cd-K:)P(RM-*a>c,  or  d)] 

=  (.84)((l/2)(.2)+ (1/2) (. 4+. 2+. 1) ] 

♦  ( • 16) [  (1/2)  (.2)  +  (l/2) (.4+.2+ . 1) ] 

=  .450 

YC=  JY  =  P(J>J)P(^K) 

^  «  (1) ( . 7)  =  .700 

xJ^=S  =  P(M=ac)  [P(ac-*a)P(PUF-*c) 

+P(ac'*’C)P(PUF^a,c .  or  d)] 

♦  P(M=cd)  [P(cd-*d)P(PUF-*c) 

+P(cd-*c)P(PUF-*a,c,  or  d)] 

*  (• 34) [ (1/2) (. 5868) +  (1/2)  (1) ] 

*  (■ 16) [ (1/2)  (. 5868)  +  (1/2) (1) ] 

=  .793 

X^K  *  P  (JJ-^J)P  (PUF-*-K) 

»  (1)  (1/2)  «  .500 

YC=S/XC=S=S  *  -450/. 793  =  .567 

yc=jk/xc=jk  =  -7/,s  “  1,40 

P(POC=S)  =  X^T2  =  .793 

P(P0C=JX)  =  X™j=JK  =  .500 

Yr_T  »  P  (M=ac)  P  (ac->a)  P  (RM-»d) 

+  P(M=cd)P(cd-*d)  (P(RM->d)+P(RM-»a)] 

=  (.84) (1/2) (. 1)  *  (.16) (1/2) (.1+.4) 

=  .0820 

*  P(M«ac)P(ac-a)P(PUF-d) 

C  ♦  P(M=cd)P(cd-*d)  (P(PUF-*d)+P(PUF-*^)  ] 

=  (.84) (1/2) (.0992) 

♦  (. 16) (1/2) (.0992+. 3140) 

-  .0747 

WX^S  =  .0820/.  074  7  =  1.10 

P(POOT)  -  X^=S  =  .075 


["1 


COMBINATION 


Y/Xpup  “1.53  from  Appendix  G  or  J  or  K 


POC 

P(POC)  =  P(POC1)P(POC2) 

(Y/X)  -  (Y1/X1) (Y2/X2) 

IS  (Y/X)  >  1 

Q  and  JJ 

(.132) (.500)  =  .066 

(1.27) (.600)  »  .762 

NO 

Q  and  JK 

(.132) (.500)  =  .066 

(1.27) (1.40)  ■  1.78 

YES 

S  and  JJ 

(.793) (.500)  =  .397 

(.567) (.600)  -  .340 

NO 

S  and  JK 

(.793) (.500)  -  .397 

(.567) (1.40)  *  .794 

NO 

T  and  JJ 

(.075) (.500)  =  .037 

(1.10) (.600)  *  .660 

NO 

T  and  JK 

(.075) (.500)  =  .037 

(1.10) (1.40)  =1.53 

YES 

P(all  POC)  *  1.000 


all  i  for  which 
(Y/X).>(Y/XD,rc) 


ii; 
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APPENDIX  L 

ADDITIONAL  DEFINITIONS 

ANALYSIS  TABLE 

Putative  Father's  Risk 
Mother's  Risk 

Likelihood  of  Paternity  for  a  priori  value  P 


INPUT  INFORMATION 


ORDERED 
Y/X  VALUES 

FROM  a  INPUT  3  INPUT 

APPENDICES  FROM  FROM 

J  AND  K  APPENDIX  J  APPENDIX  K 


ANAL  Y  SIS  TABLE 


.8008 


0.0000 

1.000 

0.0441 

.378 

.845 

.980 

0.073S 

.326 

.813 

.975 

1.000 

.246 

.746 

.964 

0.1113 

.232 

.732 

.961 

0.1365 

.195 

.685 

.951 

0.603 

.144 

.602 

.932 

0.2149 

.135 

.583 

.926 

0.566 

.127 

.568 

.922 

0.500 

.123 

.557 

.919 

0.3325 

.117 

.543 

.915 

0.3997 

.072 

.412 

.863 

0.103 

.068 

.395 

.855 

0.5005 

.062 

.373 

.843 

0.066 

.059 

.360 

.835 

0.6622 

.053 

.337 

.820 

0.8008 

.027 

.203 

.696 

1.0000 

0.000 

The  putative  fa' 
containing  W. /7 

ther's 
=  .395 

line  is 

the  one 

APPENDIX  M 


HYPOTHESIS  TEST  PROCEDURE 

1.  Set  a  numerical  value  of  ciy  =  the  largest  allowable  risk  of  erroneously 
deciding  that  the  putative  father  is  the  father. 

2.  Set  a  numerical  value  of  By  =  the  largest  allowable  risk  of  erroneously 
deciding  that  the  putative  father  is  not  the  father. 

3.  Set  a  numerical  value  of  Wy.  =  the  lowest  probability  of  paternity  which 
implies  that  the  putative  r  ther  is  the  father. 

4.  Set  a  numerical  value  of  P  =  the  "a  priori"  probability  of  paternity 
for  the  putative  father. 

5.  From  the  Analysis  Table,  obtain  cty  *  the  risk  in  this  particular  test 
of  erroneously  deciding  that  the  putative  father  is  the  father.  This 

is  the  number  at  the  intersection  of  the  alpha  colum  and  the  row  of  the 
smallest  number  in  the  appropriate  Wp  column  which  is  not  smaller  than 
Wc;  if  this  location  is  vacant,  use  the  number  immediately  above  the 
intersection. 

6.  From  the  Analysis  Table,  obtain  By  =  the  risk  in  this  particular  test 
of  erroneously  deciding  that  the  putative  father  is  not  the  father. 

This  is  the  nunber  at  the  intersection  of  the  beta  column  and  the  row 

of  the  largest  number  of  the  appropriate  Wp  column  which  is  smaller  than 
if  this  location  is  vacant,  use  the  number  immediately  below  the 
intersection. 

7a.  If  (1)  ay  is  not  larger  than  ay,  (2)  By  is  not  larger  than  By,  and 
(3)  Wp  of  the  putative  father  is  as  large  or  larger  than  W^.;  conclude 
that  the  putative  father  is  the  father. 

7b.  If  (11  ay  is  not  larger  than  a y,  (2)  By  is  not  larger  than  Bj ,  and 
(3)  Wp  of  the  putative  father  is  smaller  than  W^;  conclude  that  the 
putative  father  is  not  the  father. 

7c.  If  (1)  ay  is  larger  than  ciy  and/or  (2)  Bj  is  larger  than  By,  recognize 
that  the  test  results  are  inconclusive  and  more  genetic  tests  are 
necessary  to  reduce  Oy  and/or  By. 
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APPENDIX  N 

REASONS  MORE  GENETIC  TESTS  MAY  BE  REQUIRED 
TO  MAKE  THE  ANALYSIS  TABLE  MORE  DEFINITIVE 

1.  The  court  may  be  overly  stringent  and  set  cij  and/or  B^  too  low. 

2.  The  court  may  set  high  and  thus  cause  B.p  to  be  high. 

3.  The  court  may  set  low  and  thus  cause  to  be  high. 

4.  The  court  may  set  P  high  and  thus  cause  to  be  high.  In  the  limit 

as  P  approaches  1.0,  approaches  1.0;  this  statistically  states  that 
a  putative  father  who  really  is  not  the  father  has  no  chance  if  the 
court  is  certain  that  he  is  the  father. 

5.  The  court  may  set  P  low  and  thus  cause  B-j-  to  be  high.  In  the  limit 

as  P  approaches  0.0,  approaches  1.0;  this  statistically  states  that 
a  putative  father  who  is  the  true  father  will  not  be  judged  to  be  the 
father  if  the  court  is  certain  that  he  is  not  the  father. 

6.  Finally,  the  genetic  tests  may  not  be  sufficiently  definitive.  Performing 
genetic  tests  using  more  marker  systems  will,  on  the  average,  tend  to 
either  (a)  lower  the  alpha  and  beta  risks  of  the  tests  and  (b)  drop  the 
probability  of  paternity  to  exactly  0.0  if  the  putative  father  is  not 

the  father  or  raise  the  probability  of  paternity  toward  1.0  if  he  is 
the  father. 
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APPENDIX  0 

EXAMPLE  OF  STANDARD  THREE  PARTY  PATERNITY  DISPUTE  ANALYSIS 
GENETIC  SYSTEMS  ANALYZED:  ABO  and  Kell 

PHENOTYPES:  Mother  e  0  and  kk,  Child  *  and  kK,  Putative  Father  *  A2B  and  kK 
RACIAL  BACKGROUNDS:  Mother  *  Mexican-American;  Putative  Father  «  Anglo 
GEOGRAPHIC  AREA:  El  Paso,  Texas 


ALPHA 

BETA 

Kl/10 

Wl/3 

"'1/2 

"2/5 

W9/10 

0.00000 

1.00 

1.00 

0.948 

0.988 

0.994 

0.997 

0.999 

0.88 

0.948 

0.988 

0.994 

0.997 

0.999 

0.00013 

0.851 

0.963 

0.981 

0.990 

0.998 

0.00068 

0.795 

0.946 

0.972 

0.986 

0.997 

0.00072 

0.786 

0.943 

0.971 

0.985 

0.997 

0.00S48 

0.741 

0.928 

0.963 

0.981 

0.996 

0.04049 

0.661 

0.898 

0.946 

0.972 

0.994 

0.040S7 

0.653 

0.894 

0.944 

0.971 

0.993 

0.04281 

0.648 

0.892 

0.943 

0.971 

0.993 

0.04312 

0.621 

0.881 

0.937 

0.967 

0.993 

0.74 

0.532 

0.836 

0.911 

0.953 

0.989 

0.04758 

0.485 

0.809 

0.894 

0.944 

0.987 

0.63 

0.482 

0.807 

0.893 

0.944 

0.987 

0.06552 

0.451 

0.787 

0.881 

0.937 

0.985 

o.so 

0.387 

0.740 

0.850 

0.919 

0.981 

0.38 

0.386 

0.739 

0.850 

0.919 

0.981 

0.24 

0.038 

0.150 

0.260 

0.413 

0.760 

0.13 

0.031 

0.126 

0.224 

0.366 

0.722 

1.00000 

0.00 

0.000 

0.000 

0.000 

0.000 

0.000 

The  putative 

Father's 

Wp  scores 

are  in  the 

row  with 

=  0.893 
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POUR  POSSIBLE  USES  OF  ANALYSIS  TABLE  OF  APPENDIX  N 


INPUT  DATA 

“L  PL  "c  P 

INFORMATION  FROM 
ANALYSIS  TABLE 

®T  h  WPUF 

RESULT 

.10  .25  .900  2/3 

.06552  .24  .944 

°T  *  aL 

WPUF  -  KC 

Therefore  PUF  *  F 

.05  .25  .900  2/3 

. 06S52  .24  .944 

op  >  aL 

Therefore  no 
decision  without 
more  information 
or  altered  risk 

.01  .25  .900  1/10 

.00000  .74  .482 

6t  >  6l 

Therefore  no 
decision  without 
more  information 
or  altered  risk 

APPENDIX  Q 

SUMMARIES  OF  RESULTS  IN  FOUR  ACTUAL  PATERNITY  DISPUTES 


CASE 

aPUF,min 

0 

M,min 

wi/io 

Wl/3 

V 

W2/3 

W9/10 

1 

.077 

Not 

Available 

.30 

.66 

.79 

.88 

.97 

2 

.018 

.49 

.41 

.76 

.86 

.93 

.98 

3 

.022 

.85 

.75 

.93 

.96 

to 

OO 

.99 

4 

.084 

.99 

.31 

.67 

.80 

.89 

.97 

For  a  hypothesis  test  to  indicate  a  ruling  against  the  putative  father: 

(1)  P,  the  a  priori  or  non- genetic  probability  of  paternity  of  the  putative 
father,  and  W^,  the  critical  or  decision  level  of  the  posterior  likelihood  of 

paternity,  must  be  set  such  that  Wp  ;  and 

(2)  cx-j,,  the  risk  of  falsely  rejecting  the  assumptksri^t '..at  the  putative  father 
is  not  the  father,  must  be  set  such  that  otj  >  otp^  m^n- 


For  a  hypothesis  test  to  indicate  a  ruling  against  the  mother: 

(1)  P  and  Wc  must  be  set  such  that  Wp  <  Wc  ;  and 

(2)  the  risk  of  falsely  accepting  the  assumption  that  the  putative  father 
is  not  the  father  must  be  set  such  that  >  S^j 
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AN  ALGORITHM  FOR  TRILATERATION 


James  T.  Hall 

Atmospheric  Sciences  Laboratory 
US  Army  Electronics  Research  and  Development  Command 
White  Sands  Missile  Range,  New  Mexico  88002 


ABSTRACT 

A  vector  algorithm  is  presented  for  determining  spatial  position  of  one 
or  more  objects  with  range-only  information  from  three  noncollinear  stations. 
An  error  analysis  shows  the  dependence  of  spatial  position  uncertainty  on  the 
geometry  of  the  measurement  array. 


INTRODUCTION 


The  operations  associated  with  the  objectives  of  a  National  Range  produce 
technically  complex  problems.  A  prime  example  is  the  requirement  for  time- 
space-position-  information  (TSPI)  which  is  acquired  by  instrumentation  radars. 
This  information  is  required  if  weapon  systems  effectiveness  is  to  be  evaluated. 
The  complexity  of  an  evaluation  is  exemplified  by  the  fact  that  these  weapons 
are  referred  to  as  "smart"  or  "dumb"  depending  largely  on  their  inherent  ability 
to  maneuver  themselves  to  a  predetermined  target  which  may  also  be  moving. 

Radars  likewise  have  moved  from  the  category  of  dumb  to  smart,  having 
learned  the  basic  laws  of  physics  and  acquired  the  ability  to  selectively 
filter  incoming  information.  This  filtering  is  based  on  prior  knowledge  of 
the  physical  constraints  of  the  target  involved. 

The  computer  also  points  and  drives  the  antenna  based  on  the  physical 
laws  of  motion  and  uses  the  normal  radar  tracking  signal  to  periodically 
verify  that  its  past  prediction  of  the  current  target  position  is  accurate. 

This  is  a  giant  step  in  the  basic  philosophy  of  radar  operation;  it  does 
nevertheless,  require  that  the  antenna  be  pointed.  This  single  property,  i.e., 
pointing  the  antenna,  produces  virtually  all  the  requirements  and  problems 
with  precision  pedestals,  calibration,  and  some  of  the  propagation  errors 
including  refractive  bending. 

This  report  is  concerned  with  an  analytical  scheme  which  uses  three 
noncollinear  stations  for  range-only  measurements  on  any  number  of  targets 
to  provide  TSPI.  There  is,  of  course,  no  pointing  antenna  required  in  such 
a  system.  The  type  of  station  to  perform  this  task  has  come  to  be  known  as 
distance  measurement  equipment  (DME)  stations.  This  is  not  a  new  idea,  but 
one  that  has  become  workable  with  the  advent  of  the  high-speed  computer.  The 
principle  and  existence  of  pointing  and  ranging  radars  may  never  be  replaced 
in  most  applications;  there  are,  however,  many  applications  which  require 
great  accuracy  in  spatial  determination  of  one  or  more  objects.  For  these 
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applications,  pointing  radars  become  very  expensive,  and  calibration 
procedures  and  checks  consume  increasingly  more  time. 

The  early  DME  analytical  methodology  and  analysis  was  done  in  the 
1960's  £1,  2,  33.  During  the  early  1970' s  IBM,  Cubic  and  General  Dynamics 
developed  systems  which  used  these  analytic  procedures  to  perform  a 
variety  of  tracking  and  position  location  problems.  Current  information 
on  these  systems  can  be  acquired  from  those  companies. 

Most  analytical  developments  have  followed  the  form  given  here  with 
the  exception  of  the  angle  dependence  given  in  equation  (21)  and  its  resulting 
error  analysis.  This  vector  solution  is  believed  to  offer  computational 
advantages  and  Insight  into  the  geometric  needs  of  the  triad. 


METHOD  I  (INTERSECTING  SPHERES) 

The  first  and  most  obvious  analytic  method  of  determining  the  spatial 
position,  given  three  ranges  from  three  known  points,  is  the  simultaneous 
solution  of  three  distance  equations.  Each  distance  equation  can  be  written 
to  define  a  sphere  of  radius  R^,  i  ■  1,  2,  3,  equal  to  the  measured  ranges 
from  three  known  points  to  the  unknown  point  (x,  y,  2).  Three  intersecting 
spheres  define  two  points  separated  in  this  case  by  the  plane  of  observation; 
therefore,  if  the  known  points  which  define  this  plane  are  contained  on  the 
earth's  surface,  only  one  point  will  be  of  real  concern,  i.e.,  that  above 
the  plane  of  observation. 

The  location  of  the  plane  of  observation  is  arbitrary  and  alternations 
of  the  above  conclusion  are  apparent.  The  only  analytic  requirement  is 
that  the  three  known  points  not  be  collinear. 

Let  the  observation  points  have  locations  (0,  0,  0).  (e,  0,  0)9  and 
(g,  h,  O)^,  where  the  subscripts  denote  station  number.  The  three  equations 
are  then: 

2  2  2  2 

-  *  +  y  +  *  (i) 

2  222 

R2  -  (x  -  e)  +  y  +  z  (2) 

R3  "  (x  -  g)2  +  (y  -  h)2  +  z2  (3) 

with  resultant  solutions  for  (x,  y,  z)  as: 

1  ,„2  2  ,  2V 

x  "  2l  (Rl  "  R2  +  e  >  (4) 

y  “  Ih  ^R2  '  R3  +  2x  "  8>  +  1,2  +  ®2  "  *2)  (5) 


498 


and 


(6) 


z 


y2)1* 


Observe  that  in  choosing  the  locations  of  the  stations,  (1)  was  made 
the  origin,  and  the  line  connecting  station  (1)  and  (2)  was  made  the  x-axis 
of  an  orthogonal  coordinate  system.  The  three  stations  also  define  the 
plane  of  z  ■  0.  This,  or  a  similar  selection  greatly  simplifies  the 
solution  while  remaining  semigeneral,  i.e.,  this  choice  can  always  be  made 
when  accompanies  by  a  proper  coordinate  transform. 

The  requirement  for  nonlinearity  of  the  three  stations  is  apparent  in 
the  solution  since  neigher  e  nor  h  can  be  zero.  There  are  two  conditions 
of  station  geometry  which  permit  equation  (5)  to  have  the  same  number  of 
terms  as  well  as  the  form  of  equation  (4);  this  will  be  seen  to  be  important 
in  the  section  on  Error  Analysis. 

The  first  condition  is:  if  g  -  e,  then  equation  (5)  may  be  written  as 

y  "  k  (R2  -  R3  +  h2)  <5a> 

and  the  second  condition  is:  if  g  -  o,  equation  (5)  will  be 

y  •  Ih  -  R3  ♦  h2>  (5b) 

These  conditions  require  only  that  the  choice  of  station  positions 
constitute  an  orthogonal  array. 


METHOD  II  (VECTOR) 

This  method  is  somewhat  more  complex  in  appearance,  but  for  a  general 
application  of  DME  data  to  acquire  a  solution  for  spatial  position  it  is 
remarkably  simple  when  compared  to  that  required  using  Method  I.  It  also 
gives  a  better  insight  into  the  geometry  requirements  of  trllateration ,  thus 
providing  knowledge  necessary  to  achieve  the  measurement  configuration  for 
best  accuracy  with  a  given  number  of  stations.  It  provides  directly  a  vector 
solution  which  lends  itself  to  "tracking"  problems  very  well.  For  these  and 
other  reasons  which  become  apparent,  this  method  constitutes  the  major  effort 
of  this  report. 

Given  three  noncollinear  stations  F,  M,  and  C  with  three  correspondingly 
measured  ranges  ,  R^,  and  to  any  unknown  single  point,  one  can  derive 

the  position  vector  ?  in  terms  of  two  base  line  vectors  M  and  Z  and  their 
cross-product  as  follows  (Figure  1). 
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The  vector  ?  can  always  be  written  aa  a  linear  combination  of  the  vectors 
M,  5,  and  (M  x  6)  as: 

?  -  aM  +  b£  +  d(M  x  C).  (7) 

where  M  x  C  denotes  the  cross-product  of  M  and  6. 

The  vectors  &  and  Z  are  defined  by  the  geometry  of  the  station  array; 
therefore,  the  problem  is  to  find  the  set  of  scalars  (a,  b,  d) . 


Since 

\  -  1*1. 

and  R2  -  |?  -  M| 

*3  -  I*  -  S| 

squaring  these  gives: 
R?  - 


R2  -  ?•?  -  2?*M  +  M-M 


(8) 

(9) 


R2  »  -  2?-C  +  C-C 


(10) 


If,  in  equation  (9)  we  let  P*M  ■  a  and  equation  (10)  P*C  *  6,  they  can  be 
rewritten  using  equation  (8)  as: 


R2  -  R2  -  2o  +  |M | 2 

and 

R2  -  R2  -  28  +  |£|2 

Solving  for  a  and  8  gives: 

-  4  +  jM|2 


and 


(9a) 


(9b) 


(ID 

(12) 
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product  of  equation  (7)  with  M  gives. 


?*M  »  aM*M  +  b<5*M  +  d(M  x  <S)*M 


; 


l 


1 

I 


or 

o  ■  aj5|2  +  b<$*M. 

Dotting  equation  (7)  again  with  Z  gives. 

-  a3-£  +  b£-C  +  d(M  X  Z)-C 
or 

6  -  aZ  M  +  b]£|2 

Solving  equations  (13)  and  (14)  for  a  and  b  results  in 


and 


a  ?*M 

6  Z-Z 

a  £*M 

0  z-z 

L  qi^i2  -  p&% 

a  - 

M-M  £* 
•+  -►  -► 
C-M  C« 

5| 

•21 

|Mx^|2 

i^i2 

•+  -► 
M-M  a 

2*M  6 

.  M 

k|2  - 

aC*M 

b  »  ■ 

|M!2 

|Sx2| 

2 

final  scalar  is 

found 

from 

?«(M  x  Z) 

-  d|M 

x  Z\2 

(13) 


(14) 


(15) 
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+  b£»?  +  d(i$  x  £)•? 


combining  (15)  and  (16)  as: 

R2  -  aa  +  bB  +  d2jM  x  <5]2 

or  5 

R  -  aa  -  bS 

d  “  |M  x  Z\Y  ’ 

Finally 

(R2  -  aa  -  bB)** 

j  „  - i - - - . 

|m  x  <$ I 

Therefore,  the  expressions  for  the  scalars  are: 

.  .  °|g|2  -  ,  vhsre  »  -  |S*?I. 

y2 

6 (Ml 2  -  q(?’M) 
b  ■  2 

y 

and 

(R?  -  aa  -  bB)5* 

(1  »  '  "  y  - - 

The  position  vector  may  now  be  written  as 

£  .  q|£l2  -  g(?-M)  ^  +  S_1  m  1 2  -  a&gl  t 
mi2  y2 

(R2  -  aa  -  bB)i*(l?  x  6) 
—  $ 


ir, 


By  noting  that: 


X .  5  md  iiiX . ; 


'  *  *•  ■+•  -v  •+. 

Where  i,  j,  k  are  unit  vectors  in  the  M,  c,  (M  x  C)  directions  respectively 
and  using  the  expressions  for  the  dot  and  cross-products.  P  can  be  rewritten 


?  „  (-2 - !L  cose)  Csc2ei  +  (-£-  -  —  Cose)  Csc2ej 

|M|  |cf  |c|  |M| 

+  Cr2  -  ((-—)  +  (-—)  )  Csc2e  +  **  k  ,  (21) 

1  |M|  |S|  f 

where  9  la  the  angle  between  M  and  C. 

Since  the  magnitudes  of  M  and  C,  that  is  |M| ,  |£|,  and  6  are  fixed  for 
a  given  station  geometry,  the  only  quantities  to  be  recurrently  calculated 
are  a  and  0  as  given  by  equations  (11)  and  (12).  For  the  general  case  the 
simplicity  of  this  procedures  as  compared  to  the  intersecting  spheres  method 
is  apparent. 

For  the  special  but  desirable  case  that  i  normal  to  J,  equation  (21), 
reduces  to: 


|m|  \Z\  |m)  |B) 


(21a) 


which  gives  directly,  if  desired,  the  Cartesian  coordinates  in  the  i,  J,  k 
system  as: 

f*i  •  X  •  $‘i  -  y  - 

|S|  |C| 


?*k  ■  z  •  (R2  -  -  -^)*5  . 

1  |S|  |2| 
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The  correlation  between  these  values  and  those  of  equations  (4) ,  (5b) , 
and  (6)  is  obvious. 


»  „In  the  general  case,  the  transform  to  a  chosen  set  of  orthogonal  coordinates, 
i*,  g*.  is  given  by: 


l.e. , 


i*-i  -  au,  i**j 

A  A  A 

j*-i  -  a2l.  J*-j 
k**i  ■  a^, 


al2* 

a22* 

a32‘ 


i*.k 

A  * 

j**k 

k**k 


a13 

a23 

a33 


V* 

n 


3 


l  KntVv  n  -  1,  2, 


3 


(22) 


where  V  are  the  vector  components  of  ?  and  V*  are  Cartesian  coordinates 

x  “  v?»  y  *  v5*  and  2  “  V3*  The  Ani  are  C^e  n^irectl-on  cosines  of  the 
transform.  n 


ERROR  ANALYSIS 


The  error  analysis  of  Method  I  and  Vector  Method  will  be  accomplished 
using  the  propagation  of  error  principle  which  states  for  a  quantity(s)  which 
is  a  function  of  two  or  more  measured  quantities,  i.e.,  s  -  s(p,q,u),  the 
uncertainty  in  s,  6s,  is: 

-  _  r,3s*2,,  .2  ,38.2.*  .2  ,3s.2,*  .2-i4 

6a  l(-^)  +  +  (5u>  -J 

where  <$p,  6q,  and  6u  are  the  uncertainties  in  the  measured  quantities.  This 
expression  assumes  a  symmetric  distribution  of  the  measurement  uncertainties, 
i.e.,  positive  and  negative  errors  are  equally  probably  in  the  measured  ranges. 

Method  I 


Using  equation  (4),  where  x  -  x(R  ,  R.) ,  gives  for  the  partial 
derivatives : 

3x  Ri  3x  R2  .  .  . 

*3R^  “  e~’  "3R~  "  "  and  the  resulting  uncertainty  in  x,  6x,  is: 


«x  -  ^CR^SR^2  +  R2^)2]1*. 


2  2  2  2 

Letting  t12  ■  R^(6Rj)  +  R2(«R2)  this  expression  will  be  rewritten  as: 


6x 


(t12) 
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From  equation  (5),  where  y  -  y(R  ,  Rj,  x)  and  the  resulting  partial  derivatives 


are: 


*1 - 2.  - -1  and 

h  3x  h 


3R2  h  *  3R3 


gives  for  the  uncertainty  in  y,  6y 

(t, 


[(^)2C6RJ2  +  <^)2(6RJ2  +  (Sx)^. 


where  again  6x  • 


giving: 


5y  -  iu23  +  t12(l  -  2(g/e)  +  (g/e)2^. 

1  _  -.Jj 

Thi9  appears  to  be  a  curious  result  since  if  g  ■  0,  6y  »  ^Lt23  +  ’ 

(t23)i4 

but  if  g  ■>  e,  6y  -  — r -  . 


This  result,  however,  indicates  that  for  minimizing  the  uncertainty 
in  any  one  component,  x,  y,  or  z,  that  two  of  the  stations  should  be  aligned 
in  that  direction.  This  is  also  apparent  in  the  subscript  change  on  the  value 
of  R  in  equation  (5b)  from  that  in  equation  (5a) .  Pursuing  this  philosophy 
wouli  result  in  the  addition  of  a  fourth  station  to  optimize  the  uncertainty 
of  a  spatial  position.  The  location  of  this  fourth  station  would  necessarily 
be  directly  above  one  of  the  other  ground-based  stations  so  that  the  line 
formed  by  these  two  would  be  normal  to  the  plane  z  «=  0,  i.e.,  that  formed  by 
the  three  ground-based  stations.  That  is,  of  course,  not  an  absolute  requirement 
but  one  to  optimize,  i.e.,  minimize  the  spatial  uncertainty. 


This  argument  also  implies  that  you  should  expect  the  uncertainty  in 
the  z  component  to  be  the  largest  of  the  three  components  when  using  only 
ground-based  stations.  This  will  be  shown,  in  general,  to  be  true  since: 
z  -  z (Rj ,  x,  y)  from  equation  (6) 


p 

3z  m  _JL  _3_z  _  _3z 

3R^  z  ’  3x  z’  3y 


-y/z  and 


6*  -  ^CR2(61L)2  +  x2(6x)2  +  y2(6y)2A 
z  i  J- 


A  significant  difference  in  this  expression  for  the  uncertainty  in  z  is  the 
fact  that  it  is  inversely  dependent  on  the  value  of  z.  Producing  the  condition 
that  as  z  approaches  zero  (z  -*  0)  the  uncertainty  in  z  approaches  infinity 
(d z  -*■"). 


505 


Minimizing  the  uncertainty  in  z  would  require  small  values  in  both 
x  and  y  reaching  a  limit  when  x  ■  0,  y  ■  0  and  resulting  in 

$ z  •  -Mr.) 

Z  1 

but  for  this  condition  R  -  z;  therefore,  6z  -  Since  this  is  the  smallest 

uncertainty  that  can  be  obtained  in  any  one  component,  6z  can  obviously  range 
from  this  value  to  the  largest  and  will  in  general,  be  greater  than  6x  or  6y. 

Vector  Method 


Using  the  uncertainties  in  a  and  6  the  uncertainty  in  the  scalers  a,  b, 
and  d  can  be  evaluated.  These  then  give  the  uncertainty  in  the  position 
vector  ?,  i.e. ,  6f. 

h  .  h 

From  equations  (11)  and  (12),  the  6a  »  t^  and  6B  ■  ci3’ 
scaler  A  *  a(a,  8)  and  its  partial  derivatives  are:  1 


2  2 
3a  Csc  e  ,  3a  -Cos0  Csc  0 

‘  md  as - 


M 


M 


Since 


or 

6?-i  -  6a | M | 


i.e.,  the  uncertainty  in  the  ith  component  is: 


2  4 

.  Cos  0  Csc  0  _  -H  i3i 

12 +  ;vr" t  n-1 


M 


Inl¬ 


and  finally  rewritten  as: 


6P. 


2  C12 

CscQ(-~  + 

lMl2 


2 

Cos  6t 


ll)^ 


Icj' 


There  is  of  course,  one  very  obvious  difference  in  this  uncertainty,  i.e., 
it  is  a  function  of  the  angles  6.  This  expression  and  the  ones  to  follow  show 
the  strong  dependence  of  the  triad  geometry  on  the  resultant  accuracy  of 
spatial  position  determination  or  position  uncertainty. 
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Again  the  need  for  noncollinear  stations,  since  if  6  •  0  ,  6]^ 
It  is  also  apparent  that  to  minimize  6P^,  0  must  be  equal  to  tt/2. 
Using  the  same  procedure  gives  for  the  uncertainty  in  b, 


«,  .  S°&  t!u  + 

1  Z 


2 

Cos  8t 


M 


1*1  1*1 

and  since 

6?-j  -  <5b|cj j*j  , 

the  uncertainty  in  the  jth  component  of  £  is 

2 

_  t,_  Co 8  et10  , 

-  Csc20(-— _  +  - ^ 

J  \"t\  * 


i*r 


M 


The  final  9caler  d  is  a  little  more  complicated  since 


d  *  d  (Rj,  a,  6) 

The  partial  derivatives  are: 

3d  _  _ R1 

3R1  YfR2  -  aa  -  bg)1* 


and 


3d 

3a 


3d 

36 


-a 


2Y(R2  -  aa  -  be)*5 


-b 


2Y(R2  -  aa  -  b6)*5 


Letting  t  »  R^  -  aa  -  b6,  the  uncertainty  in  d  may  be  written  as 


<5d 


•  ^5  ^<5Ri>2  +  »\2  +  S'J 


Rewriting  this  equation  in  terms  of  a,  6,  and  6  with  consideration  given 
to  the  fact  that 

fi?k  -  (6d)y 
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results  in  the  kth  component  of  ?  to  be 


6?, 


.  _W  )*  +  as!a  {-sL .  +  &»?•>. 

k  2^  1  1  |H|2  |B|2  I«|  |2|  |?|2  12 


lx  0  2  2 

Csc  6  .  B  2gaCos6  a  Cos  8.  -i*s 

UI2  Ul2'  Nisi  |5|2>13J 


This  expression  is  greatly  simplified  if  0  *  n/2.  For  this  desirable 
geometry: 


*pk « -t^r2(5r  r  +  — f  + 

K  2t*  M  C 


2  02 

2  .  a  *12  .  C13  -j>s 


DISCUSSION 


If  the  methodology  of  tracking  systems  move  closer  to  computer  controlled 
pointing  antennas  or  nonpointing  systems  implicit  in  this  report,  software 
methods  will  become  increasingly  critical  with  regard  to  quality  of  solu“‘on. 
The  ultimate  accuracy  of  thi9  solution  is  a  function  of  target  position  th 
respect  to  triad  location  and  geometry  [A].  Although  only  three  stations 
are  required  for  a  position  solution,  in  practice  many  stations  would  be 
deployed.  The  analytical  method  presented  here  permits  the  selection  of 
those  triads  which  optimize  the  accuracy.^  To  illustrate  this  method,  baseline 
magnitudes  were  selected,  i.e.,  |c{  and  | M [ ,  uncertainties  in  the  measured 
ranges  <SR^  were  set,  and  a  spatial  point  (x,  y,  z)  determined. 

The  angle  9  between  M  and  C  was  varied  to  determine  the  effect  of  triad 
geometry  on  accuracy.  Figures  2  through  A  are  plots  of  position  uncertainty 
versus  0,  where  the  units  of  uncertainty  are  the  same  as  those  of  the  base 
lines.  The  uncertainties  in  the  x  and  y  components,  <5i  and  5j  respectively, 
have  the  same  values  in  the  vector  method  and  may  be  indicated  as  either  one 
on  the  plots. 

Observations  and  conclusions  have  been  made  in  the  body  of  this  report 
where  they  seemed  the  most  appropriate  and  will  not  be  repeated. 
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ABSTRACT 


This  study  involved  statistical  efforts  by  a  social  scientist  in  the 
field  of  camouflage  development.  Statistical  techniques  were  used  to  trans¬ 
form  subjective  data  into  objective  results  as  a  basis  of  statistical  infer¬ 
ences.  The  specific  task  was  to  Interpret  and  coordinate  subjective,  verbal 
data  generated  by  94  pairs  of  image  interpreters  viewing  aerial  film  strips 
of  tactically  emplaced  military  equipment.  Target  detections  were  accomplished 
on  film  scaled  1:9,400.  Target  identification,  if  a  correct  detection  was 
made,  was  accomplished  on  film  scaled  1:5,000.  Visual  cues  leading  to  target 
detection  and  identification  were  tabulated.  Using  the  statistical  technique 
of  minimum  contrasts,  visual  cues  for  detection  and  identification  were  objec¬ 
tively  differentiated  at  the  significance  level  of  a  =0.025 


515 


Ittiogpun  BAGS  BU® 


1.0  INTRODUCTION 

The  social  science  techniques,  when  applied  in  a  research  setting  are  . 
becoming  more  valuable  as  requirements  for  quantitative  data  Increase.  This 
is  particularly  true  when  the  raw  data  is  subjective  in  nature.  This  report 
investigated  visual  cues  for  detecting  and  identifying  a  ground-to-air  wea¬ 
pon  system.  The  visual  cues  were  obtained  by  the  social  science  technique  of 
the  open-ended  Interview  of  the  test  subjects.  This  paper  describes  the 
method  of  obtaining  this  subjective  data,  and  the  processing  of  it  into  ob¬ 
jective  definitive  results. 

2.0  TEST  SITE  ANO  EQUIPMENT 

2.1  TEST  SITE 

The  test  site  was  at  Fort  Lewis,  Washington.  The  exart  area  was  referred 
to  as  the  Merill  Drop  Zone.  The  drop  2one  was  approximately  1.75  x  0.65  km  in 
size  and  was  located  in  a  temperate  climate  zone.  The  existing  vegetation  was 
comprised  of  grasses,  shrubs,  and  pine  trees. 

2.2  TEST  EQUIPMENT 

The  test  equipment  consisted  of  a  single  tracked  vehicle.  It  was  camouflage 
pattern  painted  in  a  woodland  (US/Europe)  fall/winter  color  scheme  consisting 
of  approximately  45%  each  Forest  Green  and  Field  Drab,  and  5%  each  Sand  and 
Black  colors. 

3.0  TEST  IMAGERY 

The  test  item  was  tactically  sited  and  photographed,  using  9  Inch  strip 
color,  aerial  film,  at  scales  of  1:5,000  and  1:9,400  leach  with  60%  forward 
overlap.  The  1:9,400  test  strip  contained  15  frames  of  imagery;  the  strip  scaled 
1:5,000  contained  5  frames.  Each  frame  scaled  1:9,400  covered  a  ground  area 
of  approximately  one  square  km.  The  camera  used  was  a  ZE1 SS  RMK-15-23  mounted 
in  an  Aero  Commander  aircraft  under  contract  to  MERADC0M. 

4.0  TEST  PROCEDURES 

The  cut,  strip  imagery  were  given  to  ninety-four  (94)  pairs  of  operational 
image  interpreting  ( 1 1 ' s ) .  The  term  "operational"  is  used  to  indicate  the  sub¬ 
jects  carry  an  II  military  occupational  specialty  code  and  are  assigned  in  II 
positions.  All  had  received  service  training  in  interpretation  methods  and  pro¬ 
cedures.  The  II 's  were  read  a  briefing  in  which  they  were  told  to  look  for 
possible  military  equipment  on  the  strip  of  film  scaled  1:9,400.  Detailed  item 
analysis  would  be  accomplished  on  the  film  scaled  1:5,000.  The  social  scientist 
conducted  an  in-depth  open-ended  interview.  From  each  II  team,  statements 
were  independently  extracted  as  to  the  physical  features  about  the  surrounding 
and  the  target  that  enabled  detection  and  identification.  Their  responses, 
known  as  visual  cues,  were  then  tabulated  to  form  a  frequency  distribution. 

These  visual  cues,  subjective  in  pature,  were  then  statistically  analyzed  using 
the  method  of  minimum  contrast.  ^,3By  employing  this  method,  the  visual  cues 
were  objectively  ranked  (  ct =0.025)  as  to  first  order,  second  order,  etc.  This 
data  is  presented  in  the  next  section. 

5.0  RESULTS 

The  refined  visual  cues  for  target  detection  and  identification  are  found  in 
tables  one  and  two  respectively. 
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TABLE  1 


Significant  Differences  Between  Visual  Cues  For  Target  Detection. 


A 

B 

c 

D 

E 

F 

G 

H 

1 

0 

K 

Frequency 

A 

83 

B 

XX 

71 

C 

XX 

XX 

38 

D 

XX 

XX 

37 

E 

XX 

XX 

XX 

XX 

21 

F 

XX 

XX 

XX 

XX 

12 

G 

XX 

XX 

XX 

XX 

XX 

7 

H 

XX 

XX 

XX 

XX 

XX 

XX 

2 

I 

XX 

XX 

XX 

XX 

XX 

XX 

1 

J 

XX 

XX 

XX 

XX 

XX 

XX 

1 

K 

XX 

XX 

XX 

XX 

XX 

XX 

1 

Cell  Size  92  (Two  II  teams  did  not  detect  the  target)  xx-Significant  a=  0.025. 

A-Target  appears  geometric  (rectangular) 

B-Target  appears  lighter  in  color 

C-Track  activity  seen 

D-Target  is  set  In  the  open 

E-Shadow  seen 

F-Reflection  seen 

6-Turret  seen 

H-Target  has  height 

I -Gun  barrel  seen 

J-Target  placed  next  to  road 

K-Target  appears  smooth  In  texture 

Conclude  that  the  visual  cues  are  ordered  as  follows: 

First  Order  -  A 

Second  Order  -  B 

Third  Order  -  C  and  D 

Fourth  Order  -  E  and  F 

Fifth  Order  -  6,  H,  I,  J,  and  K 
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ii-  j 


TABLE  2 


Significant  Difference  Between  Visual  Cues  for  Target  Identification. 


A 

B 

c 

D 

E 

F 

G 

H 

I 

J  K  l  M  N  0 

Frequenc 

A 

61 

B 

XX 

46 

C 

XX 

42 

D 

XX 

33 

E 

XX 

XX 

32 

F 

XX 

XX 

XX 

25 

G 

XX 

XX 

XX 

24 

H 

XX 

XX 

XX 

XX 

XX 

16 

1 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

12 

J 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

8 

K 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

8 

L 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

5 

M 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

2 

N 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

2 

0 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

XX  XX 

1 

Cell  Si2e  77  (Fifteen  II  teams  that  made  a  correct  detection  did  not  identify 
the  target)  xx-Significant  a=0-025. 

A-Primary  lower  unit  and  Environment  Control  unit  (with  space  between)  seen 
B-Headl ight  covers  seen 
C-Surevi llance  radar  seen 

D-Front  Part  of  target  boat  shaped  (semi-pointed) 

E-Missile  launchers  seen 

F-Target  has  a  stepped  front 

6-Target  has  a  length-to-width  ratio 

H-Tracking  radar  seen 

I-Target  has  a  cluttered  turret 

J-Target  has  a  height-to-width  ratio 

K-Turret  position  is  toward  the  back  of  the  target 

L-Sloped  front  end 

M-Target  has  flat  top 

N-Driver  hatch  seen 

0-Side  edge  seen 

Conclude  that  the  visual  cues  are  ordered  as  follows: 

First  Order  -  A 

Second  Order  -  B  and  C 

Third  Order  -  D,  E,  F,  and  G 

Fourth  Order  -  H  and  I 

Fifth  Order  -  J,  K,  L,  M,  N,  and  0 
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6.0  DISCUSSION 

Tables  one  and  two  of  the  result  section  have  shown  how  social  science 
techniques  have  transferred  a  hodgepodge  of  verbal  responses  into  a  quantita¬ 
tively  ranked  (  a*0.025)  order  of  visual  cues  for  target  detection  and  iden¬ 
tification.  The  camoufleur  now  has  an  statistical  base  from  which  he  can 
make  scientific  decisions  as  to  where  to  address  his  efforts.  The  addition 
of  disrupters  and  or  nets  to  the  test  equipment  would  effect  the  first  order 
visual  cue  for  both  detection  and  identification.  The  processing  of  the 
subjective  data  into  objective  decision  making  data  will  save  much  time  and 
mone^'.  This  will  be  accomplished  by  identifying  type  and  position  placement 
of  the),  prototype  camouflage. 

iV  '  \  ,*  i  .• 

7.0  SUMMARY 

Color  aerial  imagery  containing  a  ground-to-air  military  weapon  system  were 
given  to  94  pairs  of  II  to  determine  visual  cues  for  detection  and  identification. 
These  subjective,  verbal,  responses  were  obtained  by  a  social  scientist.  Through 
the  application  of  social  science  techniques,  the  subjective  data  was  transformed 
into  objective  decision  making  data  for  camouflage  requirements.  The  application 
of  social  science  techniques  has  therefore  served  as  the  catalyst  for  obtaining 
objectivity  from  subjectivity. 
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SOME  ASPECTS  OF  ENGINEERING  TIME  SERIES  ANALYSIS 


Victor  Solo 

Department  of  Statistics .  Harvard  University 
Cambridge,  Massachusetts  02138 

0.  Introduction 

There  are  a  number  of  characteristics  of  the  type  of  time  series 
problems  met  by  civil,  mechanical,  electrical  engineers  that  distinguish 
them  from  say  those  of  econometrics,  business.  Firstly,  there  Is  usually 
much  more  data  available  -  the  engineer  has  great  control  over  the  choice 
of  sampling  interval.  Secondly,  there  is  often  an  Interest  in  calculations 
performed  In  real  time  (rather  than  at  leisure).  This  Is  especially  so  In 
adaptive  control  and  forecasting  (viz.  of  electric  power  demand).  Thirdly, 
since  engineers  deal  with  physical  processes  many  engineering  time  series 
are  distributed  (spatially)  viz.  the  determination  of  thermal  conductivity 
by  measurement  cf  temperature  In  a  conducting  solid.  There  has,  however, 
been  recent  interest,  in  spatial  time  series  in  geography.  A  fourth  area 
concerns  the  engineering  Interest  In  transfer  function  relations  between 
"Input"  ana  "output"  series  a?  uDpcsed  to  analyzing  the  structure  of 
"noise"  processes.  Of  course,  econometrics  has  a  similar  Interest.  Finally, 
since  classical  control  theory  makes  r  great  use  of  spectral  methods  (gain 
ana  phase  plots)  these  considerations  are  always  In  evidence  even  In  time 
domain  studies.  In  hydrology,  economics,  business  time  domain  methods  nave 
recently  been  more  popular.  The  aim  of  the  present  discussion  Is  to  con¬ 
sider  a  fundamental  problem  In  time  series  analysis,  namely  the  forecasting 
problem.  In  particular,  It  Is  pointed  out  how  a  very  efficient  algorithm 
(known  In  the  control  engineering  literature)  for  computing  exact  finite 
data  linear  forecasts  for  an  ARMA  time  series  model  has  been  passed  by  In 
the  statistical  times  series  literature.  This  algorithm  also  gives  an 
efficient  method  for  computing  the  exact  likelihood. 
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1 .  The  Linear  Least  Squares  Filter 


Consider  the  estimation  of  one  process  x(t)  from  measurements  on 
another  related  process  y(t)  by  measurements  at  times  tj,  giving 

y( t ^ ) .  ...,y(tN)  which  we  collect  into  a  vector  y  or  Y.  Also  write  y(t^) 
x.  *  x(t).  We  look  at  linear  estimates  of  the  form 

■»  V 


it 


N 

£  H  yi  • 

i  V1 


The  linear  least  squares  filter  (l.l.s.f.)  chooses  FL  to  ensure 
T  “  1 
H(xt  -  xfc)  (xt  -  xt)  is  minimized.  This  best  estimate  is  defined  by  the 

orthogonality  condition 

E(xt  -xt)y^  =  0,  s  =  1,  ...» N  . 

* 

Proof.  Let  xt  be  any  other  linear  estimate.  Then 


'  E^t -it* 

*  E||xt-xt|2  ♦  EI;t-x"r  +  2E(xt-xt)'(xt-xj) 


*9 


II 2  ,  r  Ilf!  I|2  .or-/..  i  \T/ 


The  third  term  vanishes  by  orthogonality.  Thus,  the  mean  square  error 

★  /s 

of  xt  is  larger  than  that  of  x^. 

Remark.  If  we  allow  nonlinear  functions  of  the  past  then  jL  =  E(x.Jy)  so 
we  often  write  the  linear  estimate  as 

?t  =  ^2tl^ 

a  wide  sense  conditional  expectation.  (Note  E(xt|y)  is  defined  too  by  an 
orthogonality.)  'Irr-fact,  xt  -  E(xt|y)  Is  orthogonal  to  any  linear  or  non¬ 
linear  combination  of  the  past. 
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We  can  apply  the  orthogonality  condition  to  solve  our  problem  thus. 


substituting  for  xfc  Implies 

N 

E (xt  -  I  HtiYi)Ys  =  0,  s  =  1,  . . . ,  N 
or 

E(xty*)  _  ~tE^^'  ^  *  5 
or 

Ht  «  E(xty,)E(yy')"1  . 

Thus 

it =  =  E^t^')E^')"1y  * 


The  problem  with  this  form  of  the  solution  is  that  it  involves  a  matrix 
inversion  so  we  are  lead  to  a  second  approach.  First  process  the  data  y 
to  whiten  it  i.e.  uncorrelate  it  then  the  inversion  is  easy,  since  the 
matrix  is  diagonal. 

We  perform  a  Cholesky  factorization  of  E(yy’)  =  UDUT  where  U  is  upper 
diagonal  with  I's  on  the  leading  diagonal.  D  is  block  diagonal.  Consider 


Then 


“1  1  ' 

v  =  U  y  =  (v, . vN)  . 

e(wt)  =  y-Vy-1  =  d 


i.e.  EKvL  =  0,  i  f  j,  i,  j  =  1,  ...,M  i.e.  the  v.  sequence  is  a  white 
sequence.  Since  ll"^  is  lower  triangular  the  are  linearly,  causally, 

invertibly  related  to  the  y^  The  is  called  the  linear  inovations 
sequence. 
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Because  of  the  equivalence  between  v,  y  the  orthogonality  condition 
can  be  rewritten 


Then  as  before 


or 


also 


A  I 

E(xt  -xt)v$  *  0,  s  =  1 . N  . 

x.  *  E(xjv)  *  E(x.v')E(vvT)"1v 

t  *■*  L  *•  •* 

A.  ti  '  _1 

5t  *  |  ^ 

St|N  *  Jt|N-l  *  E(JtsA'sN  • 


Taking  variances  gives 

*%-?t|Nll  =  ~t|N  =  ~t|N-l  "  E^~t~N^RN  E^5t^  * 


Remark. 


=  ~N  ‘  ~N|N-1 


e(yn  '  (!h"Xn,,vs  =  e^n~s}  =  5’  5  =  1 . .  • 


It  follows  then  by  the  uniqueness  of  v$  that  y^  -  =  Xn | N-l * 

To  complete  the  algorithm  we  need  recursive  formulae  for  the  Cholesky 
factoring.  Now 


■  it  -  it  t-i -  it  -  *f  E<v$>* 


s  s 


so 

E<ViV 


t-1 


«wt>-  f  E<vivs>R;'E(Vt>-  t  =  2* • 


E<Wl>  ■  E<¥~.<i>  • 


m+r  1 
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J  X  > 


Also, 


*  E<vl>  ■  E<V»)  • 

Me  will  return  to  this  algorithm  below. 

2.  Relation  to  Wiener  Filtering 

Take  (i)  xt>  yt  jointly  stationary  (so  t^  are  equispaced). 

(ii)  Suppose  an  "infinite  past"  of  Y  data  is  available.  We  want 
k|_w  =  E(x^ •••  )•  Again  we  first  whiten  the  data  yk.  Suppose  yk 
has  spectrum 

$yY(Z)  •  l  E(YkYQ)Zk 
.00 

■  N«vi»  • 

Here  Z{*}  denotes  the  Z  transform.  If  <PYY (Z )  Is  nonsingular  on  |Z|  *  1 

it  has  a  factoring  <pYY(Z)  =  W(Z)W(Z”^)  where  W(Z),  W  ^(Z)  are  analytic 

on  | Z |  i  1  and  lim  W(Z)  <  ®,  W(Z)  has  no  Z”1  powers.  (This  is  an  "infinite" 
Z-*» 

analogue  of  the  finite  data  Cholesky  factor.) 

Consider  now  vk  =  W~^(Z)Yk.  Then 

z«(Vo»  =  *w(z) 

•  l  W“1(Z)E(YkYQ)w'1(Z'1)Zk 

—  CD 

=  W'1(Z)0>Yy(Z)W"1(Z'1)  =  I  . 

So  is  a  white  noise  sequence  of  variance  I  and  vk  are  causally  linearly 
equivalent  to  Yk.  So  we  must  be  able  to  calculate  xk|k_-|  as 
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*k|k-l  =  E(xklvkvk-1 

=  E()ckvk)vk  +  E(xkvk_1)vk_1  +  ... 

=  E(xovoK *  E{xov-i)vk-i + 

■  l  °f\  '  Mr’K 

where  =  E(xqV_j) 

0(Z_1)  =  l  d.I j  =  I  D.Zj  +  1  O.Zj 
-00  J  0  J  -00  J 

=  D+(Z)  +  D_(Z) 

Thus  the  transform  from  to  xk|k_.j  is  D+(Z"^).  So  the  transform  linking 
*k  t0  *k|k-l  is 

*k|k-i  ■  MZ'S  ■  o.(z''>M'’(z>yk  • 

Finally  observe 

D(Z)  =  I  E(x0v _.)Z‘j 
•00  ^ 

=  I  E(x0W"1(Z)y  )Z‘j 
-00  V  J 

•  W'1(Z)4»xy(Z)  . 

Thus 

Xk|k-1  =  ffiforL  wITTyk* 

which  is  the  well  known  formula  for  the  Wiener  filter. 
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3.  Exact  Likelihoods  Via  the  Linear  Least  Squares  Filter 


Under  a  Gaussian  assumption  on  the  data  y  the  likelihood  or  joint 
density  of  the  y  data  is 

In  L  =  constant  -  jin  ] R (  -  ^  yTR"V 

where  R  *  E(yy^) 

J  N  P  1  N  A  A 
=  constant  -  ^  I  In  -  j  l  ^\f°\  • 

T 

This  follows  since  R  =  UDU  implies 

y'R-1y  =  yVVVV  =  vTp_1v 
and 

|R|  »  I u | 1 D | | uT !  =  |DI  =  n  o\  . 

Alternatively,  the  second  expression  follows  by  writing  L  as  an  Iterated 
conditional  density. 

So  to  get  the  exact  llkelihod  we  need  only  generate  the  v^.  Consider 
this  process  for  the  ARMA  model 


Yn  +  a,Y  ,  +  . . .  +  a  Y  =  e  +  c,c__1  +  ...  +  c  e_  „ 
n  i  n-i  n,  n-n  n  l  n-l  n„  n-n. 

a  a  aa 


with  initial  conditions  chosen  to  ensure  y  is  stationary.  Observe 


Next 


£<vtYn>  *  *  "I  asE<Yt-syn)*  t.n  +  r.r.n,tl.  (1) 
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However, 


u - • 


■  \  ■ 


v  5  I  «  Y„  for  some  a__ 
t  ^  -rn  n  tn 

so  If  t  >  t  +  r  so  that  t  >.  n  +  r  then  using  (2)  In  (1)  will  give 


E(V0vt)  *  -  I  »,E(Yt.$vt),  t  >r* 


So  write 


t-r-1 


i  t- 1  i 

~ i .  .  r  c/u,.  \o  1 . 


yt|t-i  •  5  e<Vt>«;'\  *  l  E<W  ^ 


by  (3) 


t-r-1  r-1 


Z  ( •  Z  «se<Vs"t»r' V  \l  e(Vt)r  vt 


•  :Zl  ./f  ‘  X  E(Vt''t,l’x'Vl 

•  -I’  >,[V  E(Y  v JR-'v  ]  *  ‘i’  [E(Y  v  )  *  T  asE(Yt_  vt)}r;’vt 

1  5  1  1 s  T  T  t-r  1  1 

>t|t-l  ’  -'f  Vt-S  *  X  E(Vi>"t’vt 

»Mr.  wt  -  yt  ♦  rj'  asyt.s  .  Also,  «,  ■  E(Vt)  .  «Vt>-  Me  c,n  9enf,n,t*  t0°' 
a  finite  recursive  algorithm  for  E(w^Vy).  For  details  (and  an  alternate 
derivation  of  the  above  filter)  see  Kallath  and  Aasnes  (1974).  This 
reference  also  discusses  the  p  step  ahead  forecasts,  p  >  1. 
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4.  Exact  Likelihood  for  Continuous  Discrete  Models 
Suppose 

's’  +  +  agS  ■  w  +  aw  +  djW 

where  s  *  ds/dt  etc,,  s  ■  s(t)  etc.,  y  *  s  +  v  and  w,  v  are  "white" 
noises  and  y  Is  sampled  at  times  tj,  ...,tfJ.  The  log  likelihood  Is 
as  before 

,  N  2  j  N  2  ? 

log  likelihood  =  const.  -  j  £  In  ^  ^  v^/a^ 

To  generate  the  observe 

1.  s(t|N)  •  s(t|N  -  1)  +  E(s(t)^)R‘1vN.  So 
SN|N  *  SN|N-1  + 

2.  s(t|N  -  1 )  +  a,  ( 1 1 N  -  1 )  +  a2s(t|N-l) 

dt 

N-l  .  ,1 

»  £  E[(g  +  a1s  +  a2s)vk]Rk  vk 

8  0  ’  lN-l  <  1  <  *N 
with  initial  condition 

3.  E(s(t)vN)  *  Es(t)[{y(tN)  -  s(tN|N-l)] 

*  E[s(t)s(tN)]  -  P(t,  tN i N  -  1 ) 

P(t,  tN  |  N  -  1 )  =  E[s(t|N-  l)s(tN|N-  1)] 
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4.  P ( N | N )  »  P(tN,  tN|N) 

*  P(N | N-l )  -  E(S(N)v^)  R*1  E(vnS|||) 

5.  Derive  a  differential  equation  for  P(t »t | N-l )  for  t^  <t<t^  by 
differentiating 

P(t,t|N-l)  =  E  ||s(t|N-l)  ||  2  =  Y  E(s(t)vN)  R'1  E(vNs(t)) 
to  yield 

A 

Acr>  dP  ^ 

^7  +  al  dt  +  a2  P  =  0 •  V 

Further  details  of  this  procedure  will  be  discussed  by  the  author 
elsewhere;  however,  compare  this  with  the  usual  approach  where 

(i)  data  must  be  equi spaced; 

(ii)  an  equivalent  discrete  model  is  formed  and  a  discrete  likelihood 
computed  so  that  the  original  parameters  occur  very  nonllnearly. 

5.  Conclusion 

The  use  of  a  recursive  form  of  the  linear  least  squares  filter  for 
estimating  one  time  series  from  another  has  been  Illustrated  In  two  cases# 
This  Idea  has  promising  use  In  the  statistical  solution  of  Inverse  problems 
where  In  the  signal  plus  noise  model  the  signal  is  the  solution  to  an 
Integral  equation. 

References: 

1.  H.  Aasnes  and  T.  Kallath  (1974),  "Initial  Condition  Robustness  of 

Linear  Least  Squares  Filtering  Algorithms,  IEEE  Trans.  Auto.  Control. 
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STRESS-STRENGTH  MODELS  FOR  RELIABILITY: 
OVERVIEW  AND  RECENT  ADVANCES 


G.  K.  Bhattacharyya 
and 

Richard  A.  Johnson 
University  of  Wisconsin,  Madison 

ABSTRACT 

A  stress-strength  model  of  reliability  Is  relevant  for  the  situation 
where  random  environmental  stresses  tend  to  Interfere  with  the  functioning  of 
a  device.  Interest  In  these  models  abound  in  numerous  disciplines  of  engineering 
and  life  sciences.  .  In  this  article,  we  present  an  overview  of  the  principal 
aspects  of  these  models  and  our  contributions  to  the  recent  advances  in 
statistical  analyses  of  the  stress  and  strength  data. 

Research  supported  by  Office  of  Naval  Research 
Grant  N00014-78-C-0722. 


1.  INTRODUCTION 


A  stress-strength  model  of  reliability  is  relevant  for  the  situation 
where  random  environmental  stresses  tend  to  Interfere  with  the  functioning  of 
a  device.  Interest  in  these  models  abound  in  numerous  disciplines  of  engineering 
and  life  sciences.  In  this  article,  we  present  an  overview  of  the  principal 
aspects  of  these  models  and  our  contributions  to  the  recent  advances  in 
statistical  analyses  of  the  stress  and  strength  data. 

Most  engineered  products  must  operate  in  environments  which  are  not 
controlled  and  the  possibility  of  random  shocks  usually  prohibits  any  strictly 
deterministic  formulation  of  the  stresses.  Stress-strength  models  use 
random  variables  to  represent  both 

(i)  the  variation  in  the  ability  (strength)  to  perform 
and 

(ii)  the  variation  in  the  stress  imposed  by  the 
environment. 

Let 

X  =  maximum  stress 

Y  »  strength  of  unit. 

In  this  context,  we  define  Reliability  (R)  =  the  probability  that  the  unit 
performs  its  task  satisfactorily,  that  is,  the  unit  is  strong  enough  to 
overcome  the  stress. 

R  =  PfStrength  >  stress]  =  P[Y>X] 


When  X  and 
R  =  P[Y>xl  = 


Y 


f 

J 


are  independent, 
F(y)dG(y)  =  1  -  [ 


CD 


G(x)df (x) 


-CD 
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(1.1) 


j. 


where  F  and  G  are  the  continuous  cumulative  distribution  functions  (cdf's) 
of  X  and  Y,  respectively. 

The  following  examples  help  to  delineate  the  diversity  of  applications. 

Example  1 .  [Rocket  engines]  Let  X  represent  the  maximum  chamber 
pressure  generated  by  ignition  of  a  solid  propellant,  and  Y  be  the  strength  of 
the  rocket  chamber.  Then  R  is  the  probability  of  a  successful  firing  of  the 
engine. 


Example  2.  [Comparing  two  treatments]  A  standard  design  for  the 
comparison  of  two  drugs  is  to  assign  Drug  A  to  one  group  of  subjects  and 
Drug  B  to  another  group.  Denote  by  X  and  Y  the  remission  times  with 
Drug  A  and  B,  respectively.  Inferences  about  R  *  P[Y>X]  based  on  the 

remission  times  data  X^,X2» - Xn  and  ,Y2, . . . ,Ym,  are  of  primary 

Interest  to  the  experimenter.  Although  the  name  'stress-strength'  is  not. 
appropriate  in  the  present  context,  our  target  of  inference  is  the  parameter 
R  which  has  the  same  structure  as  in  Example  1. 


Example  3.  [Threshold  response  model]  A  unit,  say  a  receptor  in  the 
human  eye,  operate;  only  if  It  is  stimulated  by  a  source  whose  random  magnitude, 
Y,  Is  greater  than  a  (random)  lower  threshold  for  the  unit.  Here 

P[Y>X]  *  P[unit  operates] 

is  again  of  the  form  described  above  in  stress-strength  context. 


2. 


ESTIMATION  OF  RELIABILITY  PfY  >  X] 


We  briefly  review  estimation  of  R  when  samples  are  available  from 
both  the  strength  and  stress  distributions.  Specifically,  let  X^Xg,...^ 
be  a  random  sample  from  F  independent  of  Yj  .Y^, . . . ,Ym,  a  random  sample 
from  G . 

2.1  Nonparametric  approach 

A  number  of  authors  Birnbaum  (1956)  Birnbaum  and  McCarty  (1958) 

Owen,  Craswell  and  Hansen  (1964)  have  proposed  nonparametric  estimators  of 
R  =  P[Y>X].  An  estimator,  based  on  the  count  is 

U  =  #  pairs  (X^.Yj)  with  Yj>Xj, 

Fn(y)dGm(y)  (2.1) 

where  F  ,  G  are  the  empirical  cdf’s  from  the  X..  and  Y..  ,  respectively, 
n  m  i  s  j  s 

Approximate  confidence  bounds  can  be  obtained  from  the  large  sample  distribution 
of  R  (see  also  Govindarajulu  (1968),  Sen  (1967)). 

A  difficulty  with  the  nonparametric  approach 

With  small  or  even  moderate  sample  sizes,  high  reliability  cannot  be 
verified.  For  instances,  with  m  *  n  =  11,  if  the  strength  observations  were  al 
larger  than  the  stresses,  we  obtain  .77  <  R  with  confidence  .95.  The  same 
confidence  bound  is  obtained  when  the  strength  measurements  are  only  moderately 
larger  than  the  stresses  (Figure  1(a))  as  when  they  are  considerably  larger 
(Figure  1(b)).  The  nonparametric  method  fails  to  discriminate  between  the 
two  situations. 
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Figure  Two  situations  giving  identical  confidence  bounds  with  the 
nonparametric  method. 
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2.2 

When 


Parametric  approaches 

2  2 
X  is  N(w^  ,o^ )  and  Y  is  N(v2,o2) 

iu-w, 

R  =  P[V>X]  =  ♦(  '  f  T/?) 

(o1  +o2  ; 


(2.2) 


where  C-  denotes  the  cdf  of  N(0,1).'  Church  and  Harris  (1970)  obtain  large 
sample  confidence  bounds  when  the  parameters  of  the  stress  distribution  are  known 


(see  also  Mazumdar  (1970)). 


When  a ^  ,  confidence  bounds  can  be  obtained  from  the  non-central 


t-distribution  of 


?  -  X 


//I  +  Is  (m'1)Sl2  *  (n'1)s22  ,1/2 

" *m  n  m  +  n  -  2 


as  in  Owen,  Craswell,  and  Hansen  (1964). 

Difficulty  with  the  parametric  model 

If  a  small  fraction  of  the  population  of  the  units  contain  major  defects 
of  material  or  workmanship,  a  small  or  moderate  sample  of  strengths  will  not 
show  these  ’rare’  sources  of  failure.  This  is  illustrated  in  Figure  2. 

STRENGTH 
DISTRIBUTION 
FOR  MOST 


Figure  2.  A  strength  distribution  that  is  a  mixture  of  two  failure  sources. 


In  this  situation,  use  of  an  assumed  parametric  form  for  the  stress 
distribution  will,  typically,  lead  to  estimates  of  P[Y>X]  which  are 
incorrectly  very  high. 

Even  without  such  extreme  departures  from  the  postulated  models,  tail 

areas  remain  very  difficult  to  estimate.  The  choice  between  normal, 

•  * 

Weibull  or  lognormal  tails  can  change  the  estimated  reliability  by  several 
orders  of  magnitude  (when  R  is  very  high). 


2.3  Bayesian  approach 

Enis  and  Geisser  (1971)  consider  estimation  of  P[Y>X]  when  X,  Y 
are  negative  exponential,  and  also  in  several  normal  theory  situations. 


3.  STRESS-STRENGTH  MODELS  WITH  COVARIATES 

Recently,  we  have  encountered  stress-strength  analysis  where  covariates 
play  an  important  role. 
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Example  5.  Let  Y  be  the  strength  (In  p.s.i.)  of  a  glass  fiber 
reinforced  rocket  motor  case,  and  let  X  denote  the  operating  pressure. 

The  data  of  figure  4  suggest  that  the  stress  depends  on  ambient  temperature 

z  according  to  a  linear  model 


X  =  +  +  e! 


where  e^  is  N(0,o^  ). 

For  a  given  temperature  z, 


'  al  "  612  X 

P[Y>X|Z]  =  ♦(— 2 - 2TT72  } 

(a,  +  o,  ) 


(3.2) 


□OUGLAS-FIR  2x4 


4t>  *r  r* 


Mr  ^  * 

Vr 


STIFFNESS 


Stiffness  of  wood  as  a  covariate. 


f.xaiw>UJi.  Comparison  of  Drug  A  and  Drug  B  when  ege  is  a  ccvariate 

X  -  remission  time  with  Drug  A 

Y  =  r. r>ii  is  si  on  time  with  Drug  B 

Z  ~  oor 

Data  structure: 

V  1 

x'  I 

7  1 

--11 


can  be  used  to  timate  the  linear  regressions 

E(X|z)  =  ^  +  '.yL  and  E ( Y ! z  )  -  at^  +  e^z  * 

;:0r  a  new  subject  of  age  z  ,  we  may  wish  to  provide  information  a!"  " 


Pi’  Y  X !  Z  ]  . 
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In  Example  6,  if  we  assume  that  X  and  Y  are  independent  normal  random 
variables  having  the  same  variance  o  ,  we  obtain  the  expression 


P[Y>X|z]  *  P[- 


-X-(a2-a1  )-(62-ei  )z  -(a^-a^  )z 


7T 


7 T 


] 


(ou-a ,)+(&'-&.)  z 


(3.3) 


The  resulting  reliability  is  similar  in  form  to  the  previous  cases. 

Allowing  both  X  and  Y  to  depend  on  the  covariate  does  not  lead  to  further 
complexities. 

Under  the  assumptions 

X|z1  -  N(a1  +  6^z1 ,  a2) 

Y]z2  -  N(a2  +  &2Z2'  °2^  * 

We  are  interested  in  making  inferences  about  the  reliability 


P[Y>X 


zyz2 3  *  pt 


Y"°2”^2Z2 

7To 


n 


-] 


at  » z 2 • 


Classical  Approach 

Confidence  bounds  for  R(Zq).  In  order  to  determine  a  lower  confidence 
bound  for  R(zQ),  we  note  that  Y  -  %  -  §(zQ- z)  is  normally  distributed  with 
mean  =  (Zg-z)  and  standard  deviation  ocQ,  where 


c 


2 

0 


m 


n 


(y*)2 

lU.-z)2 

j  J 
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consider 


which  Is  distributed  as  a  non-central  t  with  m+n-3  d.f.  and  the  non 
centrality  parameter 


(v2  *  a  -  S(p-2))/o 

rl  +  1  +  1&-.*)2  ]V2 
m  n 


The  numerator  of  n  is  constrained  by  the  relation 


a  - 

7F 


8(p-z) 

- )  = 

o 


.99 


The  confidence  region  for  p  consists  of  all  values  of  pg  for 
which  Hg'.p  s  Pg-  would  not  be  rejected  In  favor  of  H^-.p  >  pQ. 

The  95%  upper  bound  for  p  Is  the  largest  p  satisfying 


t(p)  *  t  05(p) 
or 

Fp<T<P„bs> )  *  05  • 


For  the  data  of  Example  5,  a  computer  calculation  gives 

p  »  46.8'  . 


4.  STRESS-STRENGTH  MODELS  FOR  SYSTEM  RELIABILITY 

The  formulation  of  a  stress-strength  reliability  model  has  been  extended 
to  multicomponent  systems  and  several  problems  of  statistical  Inferences  on 
the  system  reliability  are  addressed  In  Bhattacharyya  and  Johnson  (1974, 

1975,  1977).  Here  we  only  Include  a  brief  description  of  the  models  and 
the  approach  to  Inferences.  Details  of  the  technical  results  are  available 
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in  the  cited  references. 


Model  1 

The  primary  extension  of  a  single  component  stress-strength  model  was 
focused  on  an  s  out  of  k  system  of  identical  components.  This  is  a 
system  of  k  components  whose  strengths  Y^,...,Y^  are  independent  and 
Identically  distributed  (lid)  random  variables  with  a  continuous  cdf  G(x). 

The  system  successfully  performs  its  mission  if  at  least  s  (1  s  s  s  k) 
components  are  operative. 

First  we  consider  the  situation  where  all  k  components  of  the  system 
encounter  a  conwon  random  stress  X  whose  cdf  is  denoted  by  F(x).  The 
reliability  R  .  of  this  system  is  then  given  by 

Rs  k  *  P[at  least  s  of  Y1 . >  X] 

k 

»  I  ( 

Jt=s 

=  1  -  f  B[G(x)]dF (x)  .  (4.1) 

e  C.] 

where  B  is  the  beta  distribution  with  pdf  * u  (1  -u) 


*)  f  [l*G(x)3  G(x)dF (x) 


A  parametric  approach.  Assuming  the  exponential  distributions 


F(x)  *  1  -  exp  (-0^x) 

G(x)  =  1  -  exp  (-02x)  ,  X  ■  6^/02  » 


the  system  reliability  is  given  by 


1 


B(s+A,k-$+l)  k! 

BTsS-i+rr  ‘ 


i 

_ 

n  (x+j) 

jrs 


(4.2) 
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and  exact 


The  uniformly  minimum  variance  unbiased  (UMVU)  estimator  of  R  v 
confidence  bounds  are  derived  in  Bhattacharyya  and  Johnson  (1974).  This 
work  also  includes  a  study  of  the  bias  and  mean  squared  error  of  the  maximum 
likelihood  estimator. 


Nonoarametric  estimation.  Let  and  Y^,...,Yn  denote 

independent  random  samples  from  F  and  6,  respectively.  The  corresponding 
empirical  cdf's  are  denoted  by 


(x) 


#X .  s  x  «  /  \  _  Hi  s  x 

*  »  «n  (a)  "  _ J _ 

n1  n2  n2 


A  plausible  estimator  R 
and  6  in  (4.1)  by  F 

nl 

this  leads  to 


of  R  can  then  be  obtained  by  replacing  F 

S  » K 

and  6  ,  respectively.  After  some  simplifications, 

n2 


l  '  6(^)1(su> ' 1 


(4.3) 


where  <  ••• 
combined  sample, 
estimator  R  of 


<  St  \  are  the  ordered  ranks  of  the  Y-values  in  the 

Employing  the  idea  of  a  generalized  U-statlstic,  the  UMVU 

R  .  is  derived,  and  it  is  given  by 
s ,  x 


R 


1 


n5> 

ni  (  k ) 


n2-s+l 

l 

l*k-s+l 


[£-l 

lk-s 


)( 


n-t 

s-1 


)(s 


(4.4) 


The  statistical  properties  of  these  estimators,  and  large  sample  confidence 
intervals  for  the  system  reliability,  R,  .  ,  are  explored  in  Bhattacharyya  and 

J 

Johnson  (1974,  1975,  1977). 
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A  few  other  models,  considered  In  Bhattacharyya  and  Johnson  (1974) 
bare  mention. 

Model  2.  s  out  of  k  system  with  standbys. 

,  Operating. 

independent  components  with  strength  distribution  6^ 

Standbys. 

k^  a  k  -  k^  independent  components  with  strength  distribution  6^ 


A  single  stress  X  is  applied  to  all  components. 


Model  3.  Subsystems  with  independent  stresses. 

Subsystem  q. 

kq  independent  components  with  strength  distribution  6^. 

Stress  with  cdf  Is  applied  to  each  component. 

Subsystem  operates  if  s^  out  of  k^  components  operate. 

Model  4.  [Binomial  Counts]  Here  we  consider  the  structure  of  Model  1  with 
the  variation  that  the  test  conditions  do  not  permit  independent  sets  of 
strength  and  stress  measurements.  Rather,  the  prototype  components  are 
tested  under  random  stress  conditions  that  prevail,  and  all  that  is  recorded 


5A6 


are  z^ ,  i  =  1 »  • . • 


n  where 


1,  if  component  i  functions 
0.  if  component  i  fails. 


In  the  case  of  a  system  nith  c  subsystems  where  each  subsystem 
conforms  to  the  structure  of  a  single-component  stress-strength  model*  the 
problem  reduces  to  one  of  estimating  the  system  reliability  from  the  binomial 
counts.  See  Myhre  and  Saunders  (1968),  Madansky  (1965)  and  Easterling  (1972) 
for  discussions  about  inference  methods.  However,  certain  complications  arise 
when  groups  of  components  are  simultaneously  tested,  each  under  a  random 
stress,  and  the  group  size  is  different  from  the  system  (or  subsystem)  size. 

In  the  context  of  Model  1,  Bhattacharyya  (1977)  discusses  nonparametric 
estimation  of  R  .  when  gn,jps  of  m  components  are  tested  under 

S  •  K 

independent  stresses  and  only  the  failure  count  is  recorded  for  each  group. 
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ON  THE  INTERPOLATION  OF  GRAVITY  ANOMALIES  AND 
DEFLECTIONS  OF  THE  VERTICAL  IN  MOUNTAINOUS  TERRAIN 


H.  Raussus  von  Luetzow 
U.S.  Army  Topographic  Laboratories 
Fort  Belvoir,  Virginia  22060 


ABSTRACT :  The  paper  first  addresses  the  interpolation  of  gravity  anomalies 
In  mountainous  terrain,  to  be  represented  as  the  sum  of  a  “signal"  variable 
with  a  quasl-statlonary  estimation  structure  and  a  computable  "noise"  variable 
without  a  stationary  character.  It  then  develops  the  particular  solution  of  the 
boundary  value  problem  of  physical  geodesy  which  permits  a  similar  representation 
of  deflections  of  the  vertical  and  draws  some  conclusions  concerning  the 
Inapplicability  of  Molodensky's  series  approach  and  of  the  collocation  method 
for  the  accurate  determination  of  vertical  deflections  from  unmodified  gravity 
anomalies  in  mountainous  terrain.  Thereafter,  It  discusses  the  estimation  of 
signal-type  deflections  of  the  vertical  by  means  of  spatial  covariance  functions, 
i.e.,  by  a  linear  regression  technique  called  statistical  collocation  in  physical 
geodesy,  and  provides  first  order  expansions  of  planar  covariance  functions. 


1.  INTRODUCTION.  Deflection  of  the  vertical  components  C  and  n  play  a 

role  in  the  adjustment  of  geodetic  networks,  in  the  computation  of  height  anomaly 
differences,  and  in  the  transformation  of  local  coordinates  into  terrestrial 
coordinates.  Short  of  a  three-dimensional  solution  of  the  geodetic  boundary  value 
problem  under  consideration  of  mountainous  terrain,  deflection  components  and  gravity 
anomalies  Ag  are  also  desirable  for  the  numerical  upward  continuation  of  the  first 
order  derivatives  of  the  anomalous  gravity  potential.  The  interpolation  or 
estimation  of  gravity  vector  components  In  flat  terrain  is  not  inherently  difficult. 
In  mountainous  terrain,  gravity  anomalies  Ag  are  profitably  modified  to  Faye 
anomalies  Agp  by  means  of  terrain  corrections  C,  to  be  followed  by  a  transformation 
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to  Bouguer  anomalies  Agg  which  permit  an  approximate  two-dimensional  Interpolation. 
Isostatic  gravity  anomalies  Ag^ ,  to  be  corrected  by  the  indirect  effect,  would 
require  a  three-dimensional  interpolation  technique  In  the  case  of  high  accuracy.  The 
problem  of  deflection  estimation  has  been  discussed  by  Heiskanen  and  Moritz  [1967]  ar 
others .including  the  method  by  Molodensky  et  a!.  [1962]  for  the  calculation  of  deflec 
differences  in  flat  terrain,  and  the  difficulty  to  interpolate  n  in  rough  moun¬ 
tainous  terrain.  Baussus  von  luetzow  [1980]  addressed  the  optimal  densi fication 
of  deflections  of  the  vertical  in  flat  terrain  with  and  without  consideration  of 
gravity  anomalies  and  extended  Molodensky's  approach.  Badekas  and  Mueller  [1968] 
utilized  Eotvos  torsion  balance  measurement  together  with  appropriate  terrain 
corrections  for  the  interpolation  of  vertical  deflections,  a  time-consuming  procedure 
and  soon  to  be  replaced  by  the  employment  of  moving  base  gravity  gradiometers . 
Regardless  of  these  efforts,  an  effective  C ,  ^-estimation  method  applicable  in  mounta 
terrain  will  still  be  valuable  and  may  also  aid  deflection  estimation  under 
consideration  of  a  series  of  discrete  inertial  measurements.  Section  2  of  this 
study  addresses  the  interpolation  of  gravity  anomalies  in  mountainous  terrain.  In 
section  3,  the  appropriate  solution  of  the  boundary  value  problem  for  vertical 
deflections  is  presented  and  reformulated  for  optimal  deflection  estimation  of - 
"signal"  components  of  6  and  n  and  computation  of  topographic  "noise"  terms.  The 
estimation  of  signal-type  components  by  means  of  spatial  col’ocatlon  and  the 
development  of  first  order  approximations  of  spatial  covariance  functions  is  the 
subject  of  section  4. 

2.  INTERPOLATION  OF  GRAVITY  ANOMALIES.  It  is  well  known  that  an  accurate 
analytical  representation  of  free-air  anomalies  in  pronounced  mountainous  terrain 
can  only  be  achieved  by  a  polynomial  of  high  degree  by  means  of  Ag-data  available  in 
a  network  of  high  resolution.  As  a  consequence,  satisfactory  linear  interpolation 
requires  small  mesh  sizes  ax.  Ay.  The  following  modified  anomalies  have  been 
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useful  for  geodetic  applications  and  the  purpose  of  Interpolation: 


Agp  •  Ag  +  C 


O) 


where  C  is  the  terrain  correction.  Is  called  Faye  anomaly. 


agB  • 


gp  -  bh  + 


3«T 

2R 


(2) 


Is  the  modified  Bouguer  or  complete  topographic  anomaly  where  b  »  0.112  mgalnf^ 
is  the  Bouguer  gradient,  h  is  the  elevation  of  terrain,  AT  is  the  potential 
of  topographic  masses,  and  R  *  6371  Km  Is  the  earth's  mean  radius. 


Ag1  *  AgB  +  Cj  +  a5c  *  Ag  +  C  -  bh  +  Cf  +  a5c  +  r  (3) 

Is  the  Isostatic  anomaly  valid  for  the  compensated  geoid  with  a  •  0.3086  mgal  m“^ , 
A;  as  the  vertical  separation  between  geoid  and  cogeold,  and  r  as  a  random  error. 
Equation  (3)  may  be  further  written  as 

Ag  *  Ag|  +  Ct  +  r  (4) 

where  C*  represents  the  aggregate  of  terms  computable  from  the  known  topography. 

In  a  more  general  form,  also  applicable  to  the  optimal  estimation  of  vertical 
deflections,  equation  (4)  Is  reformulated  as 

m  *  s  +  n  +  r  (5) 


In  this  equation,  m  Is  a  "message"  variable,  s  Is  a  "signal"  variable,  n  Is 
deterministic  or  computable  "noise,"  and  r  Is  random- type  noise. 

Under  consideration  of  a  linear  signal  estimation  structure,  a  signal  can 
then  be  optimally  estimated  as 

se  -  L(mi  -  n1  -  (6) 

where  L  denotes  a  linear  operator  and  the  subscripts  e  and  1  refer  to  the 
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estimation  point  Pe  and  measurement  points  P^,  respectively.  The  optimal 
measurement  at  Pe  results  as 

me  *  sg  +  ne  +  re  »  L(mj  -  n^)  +  re  -  Lfr^)  +  ne  (7) 

The  estimation  error  is 

#  * 

e(me)  *  e(se)  +  e[re  -  L(r^ ) J  (8) 

The  corresponding  estimation  error  resulting  from  the  utilization  of  topographically 
unmodified  measurements  m^  Is 

e(me)  =  se  -  L{sj)  +  re  -  L(ri)  <■  ne  -  L(ni )  ^ 

=  e(se)  +  e[re  -  L(ri ) ]  +  ne  -  LCni) 

Comparison  of  equation  (9)  with  equation  (8)  shows  that  the  non-optlmal  Interpolate 
process  is  associated  with  a  "topographic"  estimation  error  ne  -  LCnt)  which 
becomes  In  general  Intolerable  In  moderate  to  rough  mountainous  terratn  and  thus 

Induces  the  requirement  of  a  fine  mesh  data  grid. 

The  Interpolation  of  isostatic  anomalies  by  means  of  spatial  collocation  will 
be  treated  In  conjunction  with  the  Interpolation  of  isostatic  deflections  of  the 
vertical  In  section  4, 

3.  formulation  of  a  vertical  deflection  solution  suitable  for  optimal 

INTERPOLATION.  Gravimetric-topographic  solutions  for  the  anomalous  gravity  potentla 
and  deflections  of  the  vertical  which  inherently  permit  a  "signal-noise"  separation 
according  to  equation  (5)  have  been  established  by  Pelllnen  [1969],  Moritz  [1969]  , 
and  Baussus  von  Luetzow  [1971].  The  latter  emphasized  that  these  essentially 
Identical  solutions  are  almost  equivalent  to  those  of  Molodensky  et  al .  [1962)  and  Br 
[  1964  3*  but  are  1ess  data  dependent,  more  direct  from  the  computational  view,  and 
more  advantageous  for  the  utilization  of  artificial  satellite  data.  The  notations  tc 


552 


be  used  are  the  following: 

-l 

;  e  T.y 
T 


■y 


s  * 

n  * 


h-const. 
h* const . 


81=  arc 


9h_ 

3x 


6 


2 


arc 


Oh 

3y 


3  3 

3 x"  *  3y 


G 


a 


S(*) 

k  =  e^.lO'Wg^sec'2 
p  =  2.67  g  cm'3 
R  =  6371  Km 
10  =  2R  sin  | 

1  »  (r^  +  Tp  -  2rArp  cos  *) 2 

o 

g 

Ag  =  g-y 
C 

Agp  =  Ag  +  C 


height  anomaly 

anomalous  gravity  potential 

normal  gravity 

prims  vertical  deflection 

meridian  vertical  deflection 

elevation  of  terrain  referring  to  moving  point  P 

elevation  of  terrain  referring  to  fixed  computation 
point  A 

northern  terrain  inclination 

eastern  terrain  inclination 

derivatives  taken  along  the  local  horizon  in  a 
northern  and  eastern  direction 

global  mean  gravity 

azimuth  angle  counted  clockwise  from  north 

angle  between  the  radius  vectors  rA  and  rp 
originating  at  the  earth's  spherical  center 

Stokes'  function 

gravitational  constant 

standard  density 

earth's  mean  radius 

see  Figure  1 

see  Figure  1 

unit  sphere  (full  solid  angle) 
measured  gravity 
gravity  anomaly 
terrain  correction 
Faye  anomaly 
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Bouguer  gradient 

potential  of  topographic  masses 


b  =  0.112  mgal  m"1 
6T 

AgB  =  AgF  -  bh  ♦  modified  Bouguer  or  complete  topographic 

anomaly 

The  geometry  involving  hA,  hahp,  R,  10,  and  1  Is  evident  from  Figure  1  below. 


Figure  1 

The  established  first  order  solution  for  the  deflection  components  is 


a 
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It  should  be  noted  that  6g  Is  In  general  very  small  and  that  the  computation 
of  ,  and  6n  requires  only  Integrations  over  4r.  It  has  further  to 

be  emphasized  that,  according  to  Baarda  [1979],  the  Inclination  angles  and 
&2  should  not  exceed  70°. 

Equation  (10)  Is  now  reformulated  under  consideration  of 

A9p  s  Ag  ♦  (agp  -  Ag)  •  Ag  ♦  6g2  (14) 

AgB  =  Ag  +  (AgB  -  Ag)  *  Ag  +  dg3  05) 


In  these  equations,  Ag  Is  a  signal  variable,  profitably  the  isostat^c  anomaly 
defined  in  equation  (3).  In  comparison  with  sg2 .  $g3  is  a  relatively  smooth 
topographic  quantity. 

The  substitutions  (14)  and  (15)  transform  equation  (10)  Into 


R  ffr.%  .  r  ,  ( cosal  dS.(*j  .  Ag  +  Gi(Ag)  Itn  BU 

+  Gl-  ) s i na (  d*  *  - G -  \tn  S2I 

♦&//[**  *  **  *  «•  <‘*>1  .(Sj)  (,6) 


The  first  two  terms  of  equation  (16),  Involving  the  anomaly  Ag,  represent  the 
"signal"  components  of  £  and  n.  The  following  three  terms  constitute  computable 
topographic  "noise."  Permitting  for  random-type  errors  r  and  r^,  equation  (16) 
can  be  written  in  analogy  with  equations  (4)  and  (5)  as 


m  s 

+  /6M 

,  +  1 

Inf  (r 

if  l  J 

\ 

(17) 


The  numerical  determination  of  the  three  topographic  terms  of  equation  (16) 
Is  a  complex  task,  which  can,  however,  be  accomplished  without  inherent 
difficulties  by  means  of  high-speed  computers.  In  this  respect,  the  Integration 
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area  relating  to  the  first  topographic  tern  can  be  considerably  restricted.  It 
appears  that  the  last  two  topographic  terms  are  particularly  subject  to  rapid 
changes  In  mountainous  terrain.  Accurate  Interpolation  Is  further  favored  If 
given  and  estimated  deflections  refer  to  points  associated  with  small  terrain 
Inclinations. 

In  accordance  with  Moritz  (1969,1,  the  second  order  correction  for  the 
height  anomaly  Is 

=  57  ff  +  <sg3)  S(ip)do  -  37  J j  Ug  +  6g3 )  do 


where 


G2  =  §7  /_/ G] (Ag  +  6g3)do  +  (Ay  +  6g3)  tn7^ 


Here,  6m  represents  the  maximal  terrain  Inclination. 
The  second  order  deflection  corrections  are  then 


_  <5C<7)(Ag)}  R__  ffr  ,  -  V  )COSal  dS(u>) 

(Ag)  f  4n  J  J  12  * 3  }  Slna  f  dip 


Mi  l  COSa  \  da 

4  il  /J  93  In4  (Smaf  dtp 


Designating  the  Integral  terms  of  equation  (20)  as  second  order  topographic 
corrections  and  6n^2\  equation  (17)  assumes  the  modified  form 


«<»)<  <»n  <  UnWi  \  rt»l 


Higher  order  topographic  correction  terms  are  not  warranted  because 
of  a  decreasing  convergence  radius  In  connection  with  higher  derivatives,  the 
assumption  of  a  standard  density  or  density  uncertainties,  respectively,  and 
Imperfect  Isostatic  equilibrium.  The  structure  of  equation  (21)  clearly  exhibits 
the  fact  that  a  highly  accurate  computation  of  and  cannot  be  achieved 
by  the  exclusive  utilization  of  free'air  anomalies  Ag.  For  the  same  reason.  Iterative 
solutions  of  the  integral  equations  for  generalized  surface  densities  by  Molodensky 
et  al .  [1962J  and  Brovar  [1964]  and  the  series  solution  by  Molodensky  et  al  .  [1962] 

in  general  do  not  converge  in  mountainous  terrain.  The  latter  permits  for 
auxiliary  boundary  surfaces  under  utilization  of  a  shrinking  parameter  k  5!o  and  thus 

Implies  the  possibility  of  analytical  continuation  with  p=0.  For  the  same  reason, 
collocation  solutions  would  only  satisfactorily  apply  with  respect  to  signal  variables 
i  and  n.  The  analytical  upward  continuation  if  the  first  derivatives  of  the  anomalous 
gravity  potential  In  mountainous  terrain  would  require  a  supplemental  approach. 

4.  SIGNAL  ESTIMATION  BY  STATISTICAL  COLLOCATION  AND  FIRST  ORDER  EXPANSIONS 
OF  PLANAR  COVARIANCE  FUNCTIONS.  As  indicated  by  Baussus  von  Luetzow  [1980], 
deflection  differences  in  flat  terrain  may  be  advantageously  determined  by  a 
combination  of  statistical  collocation  and  Vening  llelnesz  formulae  provided 
gravity  anomalies  are  also  available  in  sufficient  density  within  a  limited 
region.  Four  point  deflection  estimation  errors  with  mesh  sizes  ax  *  5  km, 

8  km,  and  24  km  were  found  to  be,  respectively,  of  the  order  0.1  arcsec,  0.2 
arcsec,  and  1.0  arcsec  In  the  case  of  estimators  free  of  errors.  Astrogeodetlcally 
determined  deflections  are,  however,  presently  associated  with  errors  of  the 
order  of  0.25  arcsec.  In  accordance  herewith,  it  is  advantageous  to  employ  a 
relatively  great  number  of  estimators  if  this  Is  feasible. 

The  signal  variable  to  be  estimated  and  representing  either  c  or  n  may  be 


xg,  and  the  estimators  may  be  written  xt  with  as  a  correlated  measurement 
error  Independent  of  x^  Under  the  assumption  of  an  existing  signal  and  noise 
covariance  structure  the  following  linear  regression  equation  can  be  formulated: 

*«  •£»1('x1  ♦  «t)  ■  AjIXj  *it) 

It  is  then  In  matrix  form,  with  bars .Indicating  covariances. 


*A  *  VMk 


vk> 


AiNik 


*  •  *  i  n 


(23) 


The  solution  for  the  regression  coefficient  matrix  follows  as 

Ai  *  *e  *K  NiK  1  (24) 

In  the  case  of  given  astrogeodetlc  vertical  deflections,  ^  may  be  composed 

2 

of  astrogeodetlc  errors  with  a  variance  (0.25  arcsec)  and  a  correlated  error 
partially  caused  by  imperfect  Isostatic  equilibrium- 

With  respect  to  the  basis  for  the  statistical  collocation  approach  In 

physical  geodesy,  reference  Is  made  to  Bjerhammar  (1973],  Grafarend  [1973], 

Krarup  [1969],  Laurltzen  [1971],  Moritz  (1970],  and  Tscherning  (1973).  Of 
significance  is  that  the  spatial  covariance  function  for  the  disturbing  gravity 
potential  has  to  satisfy  Laplace's  equation.  Baussus  von  luetzow  (1973] 
emphasized  the  necessity  to  treat  c-t  as  a  correlated  random  variable  where  ;  is 
a  deterministic  development  of  r  In  spherical  harmonics  of  at  least  degree  and 
order  15.  In  accordance  herewith,  the  requirement,  of  homogeneity  prescribes  and 
at  least  permits  In  practice  a  restriction  to  the  planar  approach  In  physical 
geodesy.  Accordingly,  yy  3  -Ag ,  yy,  and  —  are  supposed  to  satisfy  Laplace's 
equation.  It  Is  realized  that  the  convenient  requirements  of  homogeneity 
and  quasi-flat  terrain  are  only  approximately  satisfied. 
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Moritz  11976]  and  Nash  and  Jordan  [1978]  established  specific  T-covarlance 
functions  which  can  be  expanded  Into  space  In  a  closed  form.  As  has  been 
shown  by  the  latter  authors,  the  spatial  covariance  function  Is 

*TT(r,  Z\ ,  z2)  =  /  pF(p)e  '  1  ^  Jo(rp)dp 


(25) 


00 

F(p )  =  f  r^rJOo^ydr 

O 


(26) 


In  equations  (25)  and  (26),  J0  Is  the  zero-order  Bessel  function,  r  Is  the 
variable  planar  distance,  Z\  and  z2  are  the  elevations  of  two  points,  and 
$TT  (r)  is  the  planar  T  -  covariance  function. 

The  spatial  vertical  deflection  covariances  may  be  derived  from 
equation  (25)  in  the  form 


(r,  zi,z2) 

*  4-nn  (r,  zi  ,z2 ) 
where  Yi  -y(z1),y2  sy(z2). 

For^jj-  functions  which  permit  the  derivation  of  realistic  vertical  deflection 
covariance  functions,  the  Hankel  transforms  (26)  and  (25)  cannot  be  evaluated 
in  closed  form.  As  an  example,  Jordan's  [1972]  third-order  Markov  model 


(YiY2) 


-jirj 

3y2 


TT ( r  ) 


(27) 


*TT(r)  *  var  T  (1  +  ^  + 


£ 

D 


(28) 


leads  to  the  hypergeometric  function  when  Introduced  in  equation  (26).  Thereafter, 
$TT(r,  Zi,  Z-2 )  only  can  be  obtained  by  an  extremely  lengthy  numerical  Integration, 
For  this  reason,  it  appears  to  be  advantageous  to  develop  first  order  approximations 
for  spatial  vertical  deflection  covariance  functions  under  consideration  of  Jordan's 
[1972]  planar  results.  In  this  respect  It  has  to  be  emphasized  that  Jordan 
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Interchanged  the  conventional  partial  differentiations 

iix  dy 


z 


Figure  2 


Under  consideration  of  Figure  2,  It  is  to  the  first  order 

si  a  so  +  (fi)^  ;  s2  -  ce  ♦  (fr)ez2 

It  is  then,  to  the  first  order, 

‘'.If0-2!  ■  W'ofjfl2,  ‘  2;> 

It  is  further  at  level  z  =  0 
iS.  _  _  3_ 

3Z  3y  3 Z  3y  G  G  3y 
so  that 


K 


Ms 

0  3  z 


“So 


1  3AgV 
5"  sy 


1  3 - 

Z  37  So^e 


Under  consideration  of 

S^ge  -  -  h(r)  l 

it  is 


3_ 

ay 


0(r}f  ] 


3h 

Fr  cos 


(29) 


(30) 


(31) 


(32) 


(33) 


(34) 
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The  final  results  are  hereafter 

Vi  ('••«.  2!>=V^(7s,n2aVcos!“)(2>  (35) 

*>•  z2^  **nn  ^  cos2a  s1n2a)Ul  *  *) __  _  (36) 

where  4>^  and  $nn  represent  the  planar  covariance  functions  c0Ce  and  n0ne> 

respectively,  and  where  cg-  oAg-(var6g)Ji. 

In  analogy  with  equation  (30),  It  Is 

-3i  Ag2  =  Ag0  Age  +  Ag0  ||Se-  {zx  +  z?)  (37) 

Under  utilization  of  the  planar  approximation 


Ma  =  -G(i&-  +  is.) 

3  z  v3y  3x' 


It  Is 


-GA9n(|^-  +  |^)  =  *G(~^o^e+  T7A9one) 


e  3X 


With  the  aid  of  equation  (33),  equation  (39)  can  be  formulated  as 


3?e  3ri 


3 a  a-  h  3h 


-Gao  ( —  +  — e)=  -G  — tL-2-  (-  +  — ) 
-o'ay  3x  1  vT  vr  3r; 


According ly, 


■  _  .  .  3Gq  rO  n  ,h  3h,  ,  .  /«,\ 

A9j  %  (r,  Zj.  z2)  «  «gg  -  (f  +  g^r)  (2,  +  z2)  (41) 

where  *gg  -  Ag^g^. 

It  Is  evident  from  equations  (35),  (36),  and  (41)  that  these  represent  convenient 
closed  approximations  of  the  three  spatial  covariance  functions  of  particular 
Interest.  In  general,  the  planar  covariance  functions  should  apply  to  the  lowest 
z-level  In  a  particular  area  of  application. 
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5.  CONCLUSION.  The  Immediate  accurate  Interpolation  of  gravity  anomalies 
and  deflections  of  the  vertical  In  mountainous  terrain  Is  only  possible  from 
data  provided  In  a  grid  of  high  resolution.  Optimal  interpolation  from  data 
given  at  points  separated  by  distances  of  the  order  5-10  km  or  from  multiple 
data  Incorporating  measurement  noise  with  shorter  spacing  can  be  accomplished 
by  an  appropriate  representation  of  gravity  anomalies  and  deflections  as  a 
signal-noise  process  with  nonstationary  noise  computable  from  the  earth's 
topography.  In  the  case  of  deflections,  a  special  solution  of  the  geodetic 
boundary  value  problem  is  required.  As  a  first  approximation,  Faye  anomalies 
may  be  used  as  signal  variables.  Isostatic  anomalies  modified  by  the  indirect 
effect  provide  a  greater  degree  of  homogeneity  and  isotropy.  Implementation 
of  the  theory  requires  the  utilization  of  existing  "isostatic"  computer  programs 
and  the  establishment  of  a  supplemental  program  under  consideration  of 
furnished  analytical  solutions.  Signal  estimation  has  to  be  facilitated  by 
the  use  of  spatial  covariance  functions  first  order  approximations  of  which 
may  be  computed  relatively  easy  from  planar  covariance  expressions.  The 
optimal  interpolation  method  In  conjunction  with  the  special  solution  for 
deflections  Indicates  that  Iterative  or  series  solutions  of  the  boundary 
value  problem  of  physical  geodesy  cannot  be  expected  to  converge  In  mountainous 
terrain.  The  method  developed  is  of  practical  significance  for  the 
densl flcatlon  of  gravity  anomaly  and  astrogeodetic  deflection  networks  In 
mountainous  terrain  and  Is  also  valuable  or  indispensable,  respectively,  for 
the  optimal  estimation  of  gravity  anomalies  and  deflections  from  astrogeodetic 
and  Inertial  data  In  mountainous  areas. 
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A  Sequential  k-Group  Random  Allocation  Method 
with  Applications  to  Simulation 

* 

Andrew  P.  Soms 


Abstract 

A  sequential  method  of  random  allocation  is  giv.n  and  it  is 
shown  how  it  can  he  used  to  estimate  the  observed  siqnificancc 
levels  of  k-sample  nonparametric  tests.  The  sequential  technique 
is  compared  to  the  standard  random  allocation  technique  and 
shown  to  be  more  efficient.  M.  application  is  made  to  the  Dunn- 
Bonf erroni  method  of  multiple  comparisons. 
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*|  1.  The  Sequential  Allocation  Method 

• 

-  r 

l 

t  Bebbington  (1975)  showed  that  if  there  were  N  objects  (such 

t 

r, 

i  ■ 

t 

n 

i  as  file  cards)  from  which  it  was  desired  to  select  (without 

replacement  here  and  throughout)  a  random  sample  of  size  k  without 

i 

r 

i 

i 

numbering  the  N  objects,  then  one  could  proceed  sequentially  by 

l 

c 

,  !  selecting  the  first  object  with  probability  k/N  and  if  at  the  T~ 

i 

;  !  s  t  1 

1  stage  s  have  been  selected,  then  the  T+l—  object  is  selected  with  : 

> 

• 

probability  (k-s)/(N-T),  T  = 1,2, . . . ,N-1. 

We  now  state  and  prove  the  extension  to  an  arbitrary  number 

\ 

of  groups.  Suppose  there  are  N  objects  and  it  is  desired  to 

I 

sequentially  divide  them  randomly  into  r  groups  of  size 
r 

k.,k~,...,k  ,  £  k.  *  N,  i.e.,  each  allocation  has  probability 

A  *  r  i=l  A 

A  N 

1/  k  _ _ k  .  Let  siT»**-*srT  be  the  number  of  objects  selected 

for  groups  l,2,...,r  at  the  T—  stage  and  let  P^  T+^  denote  the 

s  t 

selection  probability  for  group  i  at  the  T+l^  stage.  Then  if 

f  ■ 

Pi,T+l  =  (ki"SiT)/(N'T)  ,  T  =  0,1,...,N-1  ,  (1.1) 

r 

the  selection  is  random.  Note  that  P.  .  *  k./N  and  T  P.  .  = 1. 

1,X  1  i=sl 

;  The  randomness  follows  immediately  by  noting  that  the  probability 

;  of  a  particular  assignment  is 

s 

ULv]/"' -  x/(kl..N..kr]  • 

t 

l 

^  Bebbington’ s  (1975)  result  is  a  special  case  of  the  above 

when  r  =  2. 

i 

r  | 

As  an  example,  suppose  r  «*  3,  k^  =  2,  k2  *  2,  k^  «=  3  and  N  =  7. 

In  order  to  make  the  sequential  allocation  given  by  (1.1)  we  take 

t: 

i 

[ 

k 

f 
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I 

1  ^  independent  random  numbers  i  e  1,2,..., 7.  Lot 

S  i 

®0,T  =  ®  anc*  T  “  ^  Pn  T'  *  “  i#2 ,  .  .  .  ,r ,  T  «  1,2,.. ,,N. 

i  '  j=l 

r  ^en  the  m  object,  m  =  1,2,..., N,  is  assigned  to  group  n,  where 
n  is  the  unique  integer  such  that 

|  C>  .  <  U  <  Q 

;  n-l,m  m  —  n,m 

!  Suppose  the  7  random  numbers  ar«?  ...79039 ,  .01850,  .99744,  .81812, 

.93169,  .22705,  and  .97709.  The  selection  process  is  summarized 
in  Table  1. 


1.  Selection  Process 


Stage 

1 

2 

3 

4 

5 

6 
7 


Random 

Digit 

P1T 

P2T 

P3T 

.79039 

2/7 

2/7 

3/7 

.01850 

2/6 

2/6 

2/6 

.99744 

1/5 

2/5 

2/5 

.81812 

1/4 

2/4 

1/4 

.93169 

1/3 

2/3 

0 

.22705 

1/2 

1/2 

0 

.97709 

0 

1 

0 

Group 

Selected 


Note  that  if  all  the  k^'s  are  one,  a  random  permutation  is 
produced  if  we  think  of  the  group  as  denoting  position. 


7  independent  random  numbers  lb  ,  i  c  1,2,... ,7.  Let 


°0,T 

=  0  and  Qi>T  c 

I  P,  T,  i  - 

j=l  2,T 

r,  T  =  1, 

2 , . . . ,N . 

Then  the 

m*"*1  object,  m  = 

1,2,. «.,N,  is 

assigned 

to  group 

n,  where 

n  is  the 

unique  integer 

such  that 

0  ,  < 
n-l,m 

u  <  o 

m  —  n,m 

Suppose  the  7  random  numbers  arq... 79039,  .01850 

,  .99744, 

.81812, 

.93169,  . 

22705,  and  .97709.  The  selection  process  is  summarized 

in  Table 

1. 

1 

.  Selection  Process 

Staqe 

Random 

Digit 

P1T 

P2T 

P3T 

Group 

Selected 

1 

.79039 

2/7 

2/7 

3/7 

3 

2 

.01850 

2/6 

2/6 

2/6 

1 

3 

.99744 

1/5 

2/5 

2/5 

3 

4 

.81812 

1/4 

2/4 

1/4 

3 

5 

.93169 

1/3 

2/3 

0 

2 

6 

.22705 

1/2 

1/2 

0 

1 

7 

.97709 

0 

1 

0 

2 

Note  that  if  all  the  k^s  are  one,  a  random  permutation  is 

produced 

if  we  think  of 

the  group  as 

denoting 

position. 
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2.  Applications  to  Simulation 


In  k-sample  nonparametric  tests  the  observed  significance 
level  of  the  test  is  obtained  by  considering  all  possible  parti¬ 
tions  M  of  the  (possibly  tied)  observed  values  or  (possibly 
average)  ranks  into  r  groups,  computing  the  value  of  the  test 
statistic,  and  counting  the  number  of  times  m  it  is  equal  to  or 
greater  than  the  observed  value.  The  observed  significance  level 
a  is  then  m/M.  When  the  number  of  partitions  is  large  this  is 
prohibitive  and  a  is  estimated  either  by  simulation  (taking  a 
large  random  sample  of  the  allocations)  or  by  asymptotics.  The 
advantage  of  simulation  is  that  one  can  control  the  accuracy  of 
the  estimate  (by  taking  a  large  or  small  random  sample)  depending 
on  the  importance  of  the  situation,  unlike  asymptotics  which  each 
time  it  is  used  forces  one  into  the  straight- jacket  of  committing 
a  usually  unknown  error.  Since  it  is  (perhaps  regrettably)  a  well 
known  fact  that  different  actions  will  be  taken  for  close  values 

of  a,  one  above  and  the  other  below  some  fixed  level  (e.g.,  .01, 
.05,  or  .1)  of  the  decision-maker,  the  use  of  simulation  at  least 
prevents  approximating  error  in  a  to  be  the  determining  factor. 

If  it  is  decided  to  use  simulation,  then  a  possible  procedure 
is  to  make  the  random  assignment  as  described  in  Section  1  many 
times  by  using  a  computer.  The  commonly  used  method  is  to  produce 
a  random  permutation  by  ordering  a  random  sample  of  uniform  num¬ 
bers  and  choosing  the  first  k^  indexes  for  group  1,  the  next  k2 
for  group  2,  and  so  on.  If  all  the  k^'s  are  one,  then  this  is 
more  efficient  than  Section  1.  However,  as  soon  as  the  k^'s 
depart  even  moderately  from  1,  the  method  of  Section  1  becomes 
much  more  efficient.  As  an  example,  if  -  k2  = k^ = k4  =  10  and  it 
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is  desired  to  make  2000  random  assignments  using  a  UNIVAC  1110  computer, 
a  FOkTRAN  program  using  the  methods  of  Section  1  uses  4.71  seconds  of 
CPU  time  while  a  FORTRAN  program  using  the  random  permutation  method 
takes  9.17  seconds. 

The  Appendix  contains  a  listing  of  the  FORTRAN  subroutine 
RANDM  that  uses  the  theory  of  Section  1  to  make  random  assignments. 

This  may  be  tied  in  with  any  specific  simulation  problem,  e.g., 
the  case  treated  in  Section  3 . 


3.  Applications  to  the  Dunn-Bonf erroni  Method 
of  Multiple  Comparisons 


The  D-B  (Dunn-Bonf erroni)  method  is  described  in  Dunn  (1964)  . 
Briefly,  letY^j,  i  =  l,2,...,r,  j  «  l,2,...,n.,  be  continuous  (this 
assumption  is  not  important  and  is  removed  later)  random  variables 
with  distribution  function  F^ ,  H^:  F^  =F2  =  ...  sFf(  Ha:  for  at 
least  one  pair  (i,j),  f  F ^  £Ln  the  sense  of  producing  larger  or 
smaller  values) ,  and  the  test  must  identify  which,  if  any,  pairs 
are  different.  Denote  by  the  upper  a—'  point  of  the  standard 
normal.  The  D-B  test  declares  all  those  pairs  (i,j),  i<  j, 
different  for  which 


(N)  (N+l) 
12 


za/(k(k-l) ) 


(3.1) 


where  R^  denotes  the  average  of  the  ranks  of  the  i—  group  in  the 
joint  ranking.  The  nominal  significance  level  of  this  procedure 
is  o.  The  actual  significance  level  is 

a.  «  PA  Max  2 

A  °U<j 


ij  * 


z  /  (k (k-1) ) 
a 


(3.2) 


and  may  be  obtained  by  simulation  based  on  Section  1.  Table  2 
gives  some  comparisons  of  nominal  with  actual,  using  Section  1  and 
10,000  simulations. 
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2.  Compar icon  of  Actual  to  Nominal  a 


r 

2oriv.no  n 

C  ouu  Size 

Nominal  a 

Actual  a 

3 

5 

.05 

.037 

3 

30 

.05 

.040 

3 

15 

.05 

.043 

3 

30 

.05 

.045 

3 

5 

.01 

.0030 

3 

10 

.01 

.0077 

3 

15 

.01 

.0077 

3 

30 

.01 

.010 

5 

5 

.05 

.026 

5 

10 

.05 

.036 

5 

5 

.01 

.0030 

5 

10 

.01 

.0067 

The  Appendix  contains  a  listing  of  the  program  used  for  Table  2  . 
It  thus  appears  that  D-B  is  conservative  and  we  can  remove  the 
conservatism  by  substituting  for  zQ/(k(k-l))  ,  where  d^j, 

l,2,,..,r(r-l)/2,  is  the  i—  largest  observed  values  of  Z^, 
i<j,  to  obtain  by  simulation  the  r(r-l)/2  possible  observed 
significance  levels. 

The  K-S  (Kruskal-Schef f e)  method  is  also  sometimes  used  in 
this  situation  (see,  e.g.,  Miller,  1966,  p.  166)  and  consists  of 

replacing  z  /(r(r-l))  in  (3.1)  with  h^2  >=  (x2  _  .J^2,  where 

2  tVi  2 

X  ,  is  the  upper  a—  point  of  x  with  r-1  degrees  of  freedom. 

oi  t  r*'  i 

The  comparison  of  the  critical  constants  in  Table  3  shows  that 

this  is  even  more  conservative  than  D-B. 
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3.  Comparison  of  D-B  and  K-C  Critical  Constants 


r 

2.os/(r(r‘1,) 

(*2  >1/2 
u.  05;  r-1 

*.oi/(r(r'ln 

3 

2.39 

2.79 

2.94 

3.36 

4 

2.50 

3.08 

3.02 

3.65 

5 

2.58 

3.33 

3.09 

3.89 

6 

2.64 

3.55. 

3.15 

4.10 

7 

2.69 

3.75 

3.19 

4.30 

8 

2.74 

3.94 

3.23 

4.48 

9 

2.77 

4.11 

3.26 

4.66 

10 

2.81 

4.28 

3.29 

4.82 

1/2 


If"  the  data  is  discrete,  the  D-B  method  can  be  modified  as 
in  Dunn  (1964)  and  the  random  assignment  done  on  average  ranks. 
Thus  ties  present  no  problems  in  this  approach. 

The  third  method  discussed  in  Miller  (1964) ,  the  Steel  many- 
one  rank  statistics,  is  too  time-consuming  for  the  simulation 
approach.  For  all  practical  purposes  the  exact  D-B  (use  of  the 
d^j  and  simulation)  seems  the  best  method  to  use. 
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Appendix 


*  1  c  fXAMrtf  nr  'UTN  PROGRAM  FOR  SrOUf  MT  I  At  RAnAC*  ALLOCATIONS 
r  :  c  *|R  IS  T’<f  MUMPER  OF  r, ROMPS,  »j$TM  THP  NliHRfcR  OF  SIMULATIONS,  K'S  THE  GRO"» 
■c— 5t-7eS,-XAS~TNF--^»tHHpRS  -TO-RF-ALL-OC  A  TFO - - - - 

dimension  x(ioo.<n#Z(?n,mo)fK(2o) 

10«  FORMAT  (  ) 

►  _  _ _ 


2  CONTINUE 


GO  TO  99 


END 

•FOR, IS  .SUBI 

- St*«*R-PttH*tE-  S*-»:f)--»f-M»7K,KK,  xf  Z) - - - 

c  NP  Nliy.RE P  OF  GROUPS,  K  apdAy  OF  GROUP  SIZES,  KK  nii-RER  OF  ELEMENTS,  X  ART 

c  OF  KK  El  EVENTS,  Z  Random  AU.  OC  a  T I  ON  OF  x 

- XiK-OV^Z  ("2<*v-HK»T  '<-Wrgfe»» ITftfr)  yK-ffr-H - 

00  3  l?=1  ,  NR 
t  NSC(T2)=0 

- gO  -IR  Vi  gg-t-r** - * - 

333  U(I2IsRAUUNCX) 

00  <4  I  3=1  ,  NR 

- rccn)g^ - 

fl  0(n)’=K(I3) 

SAXI=nR-1 

- po  - 

00  5  III=1,II 

5  oc(in=ocnn+Q(Tin 

- FH>..  i,  -f -total  ,-KV-  - 

ll(IU)sCKK-  tU+  I  . 

IF  (U(I4).  .F.'ld))  GO  TO  61 


7  TF  CIJ  ( 141. LF.OC  CIST)  GO  Tfl  hi 
IF  (U(T«).GT.OCfNR-n )  IN0EX  =  MP 

- GO-T-O  -64 - 

61  IN0E*=1 
CO  TO  64 


!~«  SZcU'V*  «t  ..  TO  « 

C  *LC  '-'L  A  TF  n ,  K*S  A°F  THF.  IJPni'P  SIZES  _ _  _ _  — — 

f)^Mf  »sios  X(  lOOM  •  ?  (2n#  I  i>,*>  *  *  C?0> 

jpn  rpPwAT  (  1  _ __ _ -  _ - - 


IF  (HR.E3.  PI  r.n  in  101 

MiXIzN*-! 

R[n  -t  4-) ,  Z*  L~ — - — — 

PEAfl  too,  (<(  JI  #  J=1#MR1 


1  KKSK«+X ( J) 
nn  203  Islf^K 


CALL  RA'JOMtMrt,  K,KK,  X,  Z) 

no  20  vHs|  ,?iR 


»JiJPPsK  C  J 1  I 
PO  20  J2=1,»»JPP 


jPRon=o 

no  22  J1=1»MAXI 


2  CONTINUE 

PRsCnUA<T/*iSI--A 


PRIM  100,ZAL»NSIM,PR 
GO  TO  9*> 
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